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HUMAN ANGIOMOTIN-LIKE PROTEIN 1 
CROSS REFERENCE TO RELATED APPLICATIONS 

5 This application claims priority under 3 5 

U.S.C. § 120 to United States provisional application 
serial no. 60/334,773, filed November 1, 2001; the 
disclosure of which is incorporated herein by reference 
in its entireties. 

10 

REFERENCE TO SEQUENCE LISTING SUBMITTED ON COMPACT DISC 

The present application includes a Sequence 
Listing filed on a single CD-R disc, provided in 
15 duplicate, containing a single file named pto_PB0184.txt, 
having 189 kilobytes, last modified on October 9, 2002 
and recorded October 17, 2002. The Sequence Listing 
contained in said file on said disc is incorporated 
herein by reference in its entirety. 

20 

FIELD OF THE INVENTION 

The present invention relates to novel human 
angiomotin-like protein 1 (AMLP1) , including two 

25 isoforms. More specifically, the invention provides 
isolated nucleic acid molecules of AMLP1, fragments 
thereof, vectors and host cells comprising isolated 
nucleic acid molecules encoding AMLP1, AMLP1 
polypeptides, antibodies, transgenic cells and non-human 

30 organisms, and diagnostic, therapeutic, and 
investigational methods of using the same. 

BACKGROUND OF THE INVENTION 

35 The cytoskeleton plays a number of crucial 

roles in the life of a cell, from determining its shape 
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and polarity to directing its movements in response to 
external stimuli. The capacity for movement is a 
characteristic of virtually all animal cells and is 
pivotal for a variety of cellular activities such as T- 
5 cell mediated immune responses, developmental patterning 
during embryogenesis and tissue renewal processes as 
typified by wound healing. The cytoskeleton also 
participates in other vital cellular activities that 
include exocytosis, endocytosis, and membrane -bound 
10 vesicle trafficking within the cytoplasm. Evidence is 
emerging to suggest that the cytoskeleton may also play 
an important role as a platform or scaffold for 
assembling components of cellular signaling and response 
pathways . 

15 Of these functions, the capacity of a cell to 

respond to external signals via movement is the one role 
with greatest potential impact on human cancer, as 
uncontrolled migration of cells is a hallmark of tumor 
invasiveness and is a precursor to tumor metastasis. For 

20 this reason, considerable effort has been given to 

identifying the components of the cell's signaling and 
response pathways that lead to changes in the actin 
cytoskeleton. Some of the components of these pathways 
that have been shown to be important for signaling 

25 changes in the actin cytoskeleton include members of the 
Rho family of small GTPases and their activators, 
inhibitors, and downstream targets. 

Members of the Rho family of small GTPases 
(including Rho, Rac, and Cdc42) play central roles in the 

30 transduction of extracellular signals from the cell 
membrane to downstream effector molecules in the 
cytoplasm and nucleus and are highly conserved among 
eukaryotes. Reviewed by Takai Y. et al, Physiol. Rev. 
18:153-208 (2001); Hall A., Science 279:509-514 (1998). 

35 This class of signaling molecules primarily affects 
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activities associated with the actin cytoskeleton, but 
can also influence gene expression. 

Small GTPases of the Rho family exist in two 
states: an active GTP-bound form that is preferentially 
5 associated with membrane -bound structures and an inactive 
GDP-bound form that is largely found in the cytoplasm. 
Reviewed by Bishop A.L. and Hall A. , Biochem. J. 348:241- 
255 (2000) . The active form of Rho is capable of 
interacting with a diverse set of effector molecules that 

10 includes protein kinases and adaptor molecules that 

specifically result in changes in the actin cytoskeleton 
and/or activate kinase cascades that lead to changes in 
gene expression. Hydrolysis of GTP by members of the Rho 
family of GTPases converts the proteins into inactive 

15 molecules that no longer bind to their target effector 
proteins . 

Although all members of the Rho family of 
GTPases exert effects on the actin cytoskeleton, the 
outcomes of activation of individual family members are 

20 variable. For instance, Rho activation results primarily 
in the formation of stress fibers and has been implicated 
in the processes of cell adhesion, determination of cell 
polarity, and cell migration in epithelia. Assoian R.K. 
and Zhu X., Curr. Opin. Cell Biol. 9:93 (1997); Braga 

25 V.M.M. et al, J. Cell Biol. 137:1421 (1997); Schmitz A. A. 
et al., Exp. Cell Res. 261:1-12 (2000). By contrast, Rac 
activation leads to the formation of lamellipodia or 
membrane ruffles (Ridley et al., Cell 70:401-410 (1992)) 
and appears to be associated with directed cell migration 

30 and axonal guidance (Van Aelst and D' Souza-Schorey, Genes 
Dev. 11:2295-2322 (1997)). It is therefore not 
surprising that overexpression of Rho family members has 
been associated with cell transformation and tumors in 
human patients (Ridley A.J. Int. J. Biochem. Cell Biol. 

35 29:1225-12259 (1997); Aznar S. and Lacal. J.C., Cancer 
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Lett. 165:1-10 (2001)) and are likely to have an impact 
on tumor invasiveness and metastasis. 

Proteins that interact directly with the actin 
cytoskeleton are likely to play important roles in 
5 signaling pathways upstream of small GTPases as well as 
in their downstream response pathways. Therefore it is 
of great value to identify cytoskeleton-associated 
proteins. Angiomotin is a recently identified protein 
that is expressed at high levels in endothelial cells of 

10 blood vessels and co-localizes with actin filaments at 

the leading edges of migrating cells (Troyanovsky et al . , 
J. Cell Biol. 152:1247-1254, (2001)). In addition, it 
has been found that angiomotin can bind to angiostatin, a 
known inhibitor of angiogenesis . This interaction was 

15 found to inhibit endothelial cell migration in vitro, 
suggesting that angiomotin plays an important role in 
controlling the migration of endothelial cells. 

Recent reports suggest that at least one-third, 
and likely a higher percentage, of human genes are 

20 alternatively spliced. Hanke et al . , Trends Genet. 
15(1): 389 - 390 (1999); Mironov et al . , Genome Res. 
9:1288-93 (1999); Brett et al . , FEBS Lett. 474(l):83-6 

(2000) . Alternative splicing has been proposed to 
account for at least part of the difference between the 

25 number of genes recently called from the completed human 
genome draft sequence - 30,000 to 40,000 (Genome 
International Sequencing Consortium, Nature 409:860-921 

(2001) ) - and earlier predictions of human gene number 
that routinely ranged as high as 120,000. Liang et al . , 

30 Nature Genet. 25 (2) :239-240 (2000). With the Drosophila 
homolog of one human gene reported to have 38,000 
potential alternatively spliced variants, Schmucker et 
al., Cell 101:671 (2000), it now appears that alternative 
splicing may permit the relatively small number of human 

35 coding regions to encode millions, perhaps tens of 



WO 03/037931 PCT/US02/35129 

millions/ of structurally distinct proteins and protein 
isoforms . 

Alternative splicing can introduce early stop 
codon and shortened open reading frame. For example, 
5 several alternatively spliced human growth hormone 
receptor transcripts have been identified and are 
associated with the introduction of early stop codon. It 
was found that one of the shorter isoform, although 
inactive by itself, could regulate the biological 

10 activity of the growth hormone receptor by forming 

heterodimers with the full-length counterpart and inhibit 
the function of the full-length receptor. Ross et al, 
Molecular Endocrinology 11:265-273 (1997). 

Given a role for angiomotin as an adaptor 

15 protein that interacts with both angiostatin and 

components of the actin cytoskeleton and its demonstrated 
function as a mediator of angiostatin ' s anti-angiogenesis 
activity, there is a need to identify and to characterize 
additional human genes with structural and functional 

20 similarity to angiomotin. 

SUMMARY OF THE INVENTION 

The present invention solves these and other 
25 needs in the art by. providing isolated nucleic acids that 
encode human angiomotin- like protein 1, including two 
isoforms (AMLPla and AMLPlb) , and fragments thereof. 

In other aspects, the invention provides 
vectors for propagating and expressing the nucleic acids 
30 of the present invention, host cells comprising the 
nucleic acids and vectors of the present invention, 
proteins, protein fragments, and protein fusions of the 
human AMLP1, and antibodies thereto. 
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The invention further provides pharmaceutical 
formulations of the nucleic acids, proteins, and 
antibodies of the present invention. 

In other aspects, the invention provides 
5 transgenic cells and non-human organisms comprising human 
AMLP1 nucleic acids, and transgenic cells and non-human 
organisms with targeted disruption of the endogenous 
orthologue of the human AMLP1. 

The invention additionally provides diagnostic, 
10 investigational, and therapeutic methods based on the 
human AMLP1 nucleic acids, proteins, and antibodies of 
the present invention. 



BRIEF DESCRIPTION OF THE DRAWINGS 

15 

The above and other objects and advantages of 
the present invention will be apparent upon consideration 
of the following detailed description taken in 
conjunction with the accompanying drawings, in which like 
20 characters refer to like parts throughout, and in which: 

FIG. 1 (A) schematizes the protein domain 
structure of human AMLPla and AMLPlb, FIG. 1 (B) shows 
the alignment of the myosin-tail motif of AMLPla with 
that of other proteins; 
25 FIG. 2 is a map showing the genomic structure 

of human AMLP1 encoded at chromosome llq21; 

FIG. 3 presents the nucleotide and predicted 
amino acid sequences of human AMLPla; 

FIG. 4 presents the nucleotide and predicted 
30 amino acid sequences of human AMLPlb; and 

FIG. 5 presents the expression profile of AMLP1 
by RT-PCR analysis. 



35 
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Mining the sequence of the human genome for 
novel human genes, the present inventors have identified 
human AMLP1 (including two isoforms) , an angiomotin-like 
protein, mutations of which could lead to cancer. 
5 As schematized in FIG. 1, the newly isolated 

gene products share certain protein domains and an 
overall structural organization with human angiomotin. 
The shared structural features strongly imply that human 
AMLP1 plays a role similar to that of human angiomotin as 

10 an adaptor protein that interacts with both angiostatin- 
like protein and components of the aetin cytoskeleton and 
has anti-angiogenesis activity. 

Like human angiomotin, human AMLP1 contains a 
partial Myosin-tail domain. In AMLPla, the partial 

15 Myosin-tail motif ocurrs at amino acids 351-733 

(http://www.ncbi.nlm.gov/Structure/cdd/wrpsb.cgi) . In the 
shorter AMLPlb protein, the partial Myosin-tail motif 
ends at amino acids sequence position 562 (which is the 
last amino acid for AMLPlb) . The Myosin-tail motif is 

20 represented by the coiled-coil myosin heavy chain tail 
region. The coiled-coil is composed of the tail from two 
molecules of myosin. These can then assemble into the 
macromolecular thick filament. The coiled-coil region 
provides the structural backbone of the thick filament. 

25 Other signatures of the newly isolated AMLP1 

proteins were identified by searching the PROSITE 
database (http://www.expasy.ch/tools/scnpsitl.html) . For 
AMLPla, these signatures include four N-glycosylation 
sites (51 - 54, 57 - 60, 631 - 634 and 635 - 638), one 

30 cAMP- and cGMP-dependent protein kinase phosphorylation 
site (405 - 408), twelve protein kinase C phosphorylation 
sites, seventeen Casein kinase II phosphorylation sites, 
six tf-myristoylation sites (3 - 8, 194 - 199, 244 - 249, 
566 - 571, 743 - 748 and 784 - 789), as well as three 

35 tyrosine kinase phosphorylation sites (15 - 23, 453 - 459 
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and 659 - 666) . For AMLPlb, these signatures include two 
Itf-glycosylation sites (51 - 54, 57 - 60), one cAMP- and 
cGMP-dependent protein kinase phosphorylation site (406 - 
409), six protein kinase C phosphorylation sites, twelve 
5 Casein kinase II phosphorylation sites, three N- 

myristoylation sites (3 - 8, 195 - 200 and 245 - 250), as 
well as two tyrosine kinase phosphorylation sites (15 - 
23 and 454 - 460) . 



10 FIG. 2 shows the genomic organization of human 

AMLP1. 

At the top is shown the bacterial artificial 
chromosome (BAC) , with GenBank accession numbers 
(AP001152 .4) , that spans the human AMLP1 locus. 

15 As shown in FIG. 2, AMLPla encodes a longer 

open reading frames compared to AMLPlb, and a protein of 
869 amino acids, AMLPla is comprised of exons 1-12. 
Insertion of a 66 base pair exon in AMLPlb (between exons 
7 and 8 of AMLPla) leads to frame shift and a shortened 

20 ORF with a protein of 563 amino acids. The predicted 
molecular weights for AMLPla and AMLPlb, prior to any 
post-translational modifications, are 96.8 and 63.4 kD, 
respectively. 

As further discussed in the examples herein, 

25 expression of AMLP1 was assessed using RT-PCR. RT-PCR 

detected high level expression of AMLP1 in brain, liver, 
kidney, and adrenal gland. AMLP1 expression is also 
detected in the other tissues tested, notably prostate, 
testis, lung, placenta, skeletal muscle, heart, bone 

30 marrow as well as colon tumor. 

As more fully described below, the present 
invention provides isolated nucleic acids that encode 
human AMLP1 and fragments thereof. The invention further 
provides vectors for propagation and expression of the 

35 nucleic acids of the present invention, host cells 
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comprising the nucleic acids and vectors of the present 
invention, proteins, protein fragments, and protein 
fusions of the present invention, and antibodies specific 
for all or any one of the isoforms. The invention 
5 provides pharmaceutical formulations of the nucleic 

acids, proteins, and antibodies of the present invention. 

The invention further provides transgenic cells and non- 
human organisms comprising human AMLP1 nucleic acids, and 
transgenic cells and non-human organisms with targeted 
10 disruption of the endogenous orthologue of the human 

AMLP1. The invention additionally provides diagnostic, 
investigational, and therapeutic methods based on the 
human AMLP1 nucleic acids, proteins, and antibodies of 
the present invention. 

15 

DEFINITIONS 



Unless defined otherwise, all technical and 
scientific terms used herein have the meaning commonly 

20 understood by one of ordinary skill in the art to which 
this invention belongs. 

As used herein, "nucleic acid" (synonymously, 
"polynucleotide") includes polynucleotides having natural 
nucleotides in native 5 ! -3' phosphodiester linkage - 

25 e.g., DNA or RNA — as well as polynucleotides that have 
nonnatural nucleotide analogues, nonnative 
internucleoside bonds, or both, so long as the nonnatural 
polynucleotide is capable of sequence-discriminating 
basepairing under experimentally desired conditions. 

30 Unless otherwise specified, the term "nucleic acid" 
includes any topological conformation; the term thus 
explicitly comprehends single -stranded, double -stranded, 
partially duplexed, triplexed, hairpinned, circular, and 
padlocked conformations. 
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As used herein, an "isolated nucleic acid" is a 

nucleic acid molecule that exists in a physical form that 
is nonidentical to any nucleic acid molecule of identical 
sequence as found in nature; "isolated" does not require, 
5 although it does not prohibit, that the nucleic acid so 
described has itself been physically removed from its 
native environment. 

For example, a nucleic acid can be said to be 
"isolated" when it includes nucleotides and/or 

10 internucleoside bonds not found in nature. When instead 
composed of natural nucleosides in phosphodiester 
linkage, a nucleic acid can be said to be "isolated" when 
it exists at a purity not found in nature, where purity 
can be adjudged with respect to the presence of nucleic 

15 acids of other sequence, with respect to the presence of 
proteins, with respect to the presence of lipids, or with 
respect the presence of any other component of a 
biological cell, or when the nucleic acid lacks sequence 
that flanks an otherwise identical sequence in an 

20 organism's genome, or when the nucleic acid possesses 
sequence not identically present in nature. 

As so defined, "isolated nucleic acid" includes 
nucleic acids integrated into a host cell chromosome at a 
heterologous site, recombinant fusions of a native 

25 fragment to a heterologous sequence, recombinant vectors 
present as episomes or as integrated into a host cell 
chromosome . 

As used herein, an isolated nucleic acid 
"encodes" a reference polypeptide when at least a portion 

30 of the nucleic acid, or its complement, can be directly 
translated to provide the amino acid sequence of the 
reference polypeptide, or when the isolated nucleic acid 
can be used, alone or as part of an expression vector, to 
express the reference polypeptide in vitro, in a 

35 prokaryotic host cell, or in a eukaryotic host cell. 

10 
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As used herein, the term "exon" refers to a 
nucleic acid sequence found in genomic DNA that is 
bioinformatically predicted and/or experimentally 
confirmed to contribute contiguous sequence to a mature 
5 mRNA transcript . 

As used herein, the phrase "open reading frame" 
and the equivalent acronym " ORF " refer to that portion of 
a transcript -derived nucleic acid that can be translated 
in its entirety into a sequence of contiguous amino 
10 acids. As so defined, an ORF has length, measured in 
nucleotides, exactly divisible by 3 . As so defined, an 
ORF need not encode the entirety of a natural protein. 

As used herein, the phrase "ORF-encoded 
peptide" refers to the predicted or actual translation of 
15 an ORF. 

As used herein, the phrase "degenerate variant" 

of a reference nucleic acid sequence intends all nucleic 
acid sequences that can be directly translated, using the 
standard genetic code, to provide an amino acid sequence 

20 identical to that translated from the reference nucleic 
acid sequence. 

As used herein, the term "microarray" and the 
equivalent phrase "nucleic acid microarray" refer to a 
substrate -bound collection of plural nucleic acids, 

25 hybridization to each of the plurality of bound nucleic 
acids being separately detectable. The substrate can be 
solid or porous, planar or non-planar, unitary or 
distributed. 

As so defined, the term "microarray" and phrase 
30 "nucleic acid microarray" include all the devices so 
called in Schena (ed.), DNA Microarrays : A Practical 
Approach (Practical Approach Series ) , Oxford University 
Press (1999) (ISBN: 0199637768); Nature Genet. 
21(1) (suppl) :1 - 60 (1999); and Schena (ed.), Microarray 
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Biochip: Tools and Technology , Eaton Publishing 
Company /BioTechniques Books Division (2000) (ISBN: 
1881299376), the disclosures of which are incorporated 
herein by reference in their entireties. 
5 As so defined, the term "microarray" and phrase 

"nucleic acid microarray" also include substrate-bound 
collections of plural nucleic acids in which the 
plurality of nucleic acids are distributably disposed on 
a plurality of beads, rather than on a unitary planar 

10 substrate, as is described, inter alia, in Brenner et 
al., Proc. Natl. Acad. Sci. USA 97 (4) : 166501670 (2000), 
the disclosure of which is incorporated herein by 
reference in its entirety; in such case, the term 
"microarray" and phrase "nucleic acid microarray" refer 

15 to the plurality of beads in aggregate. 

As used herein with respect to solution phase 
hybridization, the term "probe", or equivalently , 
"nucleic acid probe" or "hybridization probe", refers to 
an isolated nucleic acid of known sequence that is, or is 

20 intended to be, detectably labeled. As used herein with 
respect to a nucleic acid microarray, the term "probe" 
(or equivalently "nucleic acid probe" or "hybridization 
probe") refers to the isolated nucleic acid that is, or 
is intended to be, bound to the substrate. In either 

25 such context, the term "target" refers to nucleic acid 
intended to be bound to probe by sequence 
complementarity . 

As used herein, the expression "probe 
comprising SEQ ID NO:X", and variants thereof, intends a 

30 nucleic acid probe, at least a portion of which probe has 
either (i) the sequence directly as given in the 
referenced SEQ ID NO:X, or (ii) a sequence complementary 
to the sequence as given in the referenced SEQ ID NO:X, 
the choice as between sequence directly as given and 
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complement thereof dictated by the requirement that the 
probe be complementary to the desired target . 

As used herein, the phrases "expression of a 
probe" and "expression of an isolated nucleic acid" and 
5 their linguistic equivalents intend that the probe or, 
(respectively, the isolated nucleic acid) , or a probe 
(or, respectively, isolated nucleic acid) complementary 
in sequence thereto, can hybridize detectably under high 
stringency conditions to a sample of nucleic acids that 

10 derive from mRNA transcripts from a given source . For 

example, and by way of illustration only, expression of a 
probe in "liver" means that the probe can hybridize 
detectably under high stringency conditions to a sample 
of nucleic acids that derive from mRNA obtained from 

15 liver. 

As used herein, "a single exon probe" comprises 
at least part of an exon ("reference exon") and can 
hybridize detectably under high stringency conditions to 
transcript-derived nucleic acids that include the 

20 reference exon. The single exon probe will not, however, 
hybridize detectably under high stringency conditions to 
nucleic acids that lack the reference exon and that 
consist of one or more exons that are found adjacent to 
the reference exon in the genome. 

25 For purposes herein, "high stringency 

conditions" are defined for solution phase hybridization 
as aqueous hybridization (i.e., free of formamide) in 6X 
SSC (where 20X SSC contains 3.0 M NaCl and 0.3 M sodium 
citrate) , 1% SDS at 65°C for at least 8 hours, followed 

30 by one or more washes in 0 . 2X SSC, 0.1% SDS at 65°C. 

"Moderate stringency conditions" are defined for solution 
phase hybridization as aqueous hybridization (i.e., free 
of formamide) in 6X SSC, 1% SDS at 65°C for at least 8 
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hours, followed by one or more washes in 2x SSC, 0.1% SDS 
at room temperature. 

For microarray-based hybridization, standard 
"high stringency conditions" are defined as hybridization 
5 in 50% formamide, 5X SSC, 0.2 \ig/iil poly(dA), 0.2 ^g/^1 
human cotl DNA, and 0.5% SDS, in a humid oven at 42°C 
overnight, followed by successive washes of the 
microarray in IX SSC, 0.2% SDS at 55°C for 5 minutes, and 
then 0.1X SSC, 0.2% SDS, at 55°C for 20 minutes. For 

10 microarray-based hybridization, "moderate stringency 
conditions", suitable for cross -hybridization to mRNA 
encoding structurally- and functionally-related proteins, 
are defined to be the same as those for high stringency 
conditions but with reduction in temperature for 

15 hybridization and washing to room temperature 
(approximately 25°C) . 

As used herein, the terms "protein", 
"polypeptide", and "peptide" are used interchangeably to 
refer to a naturally-occurring or synthetic polymer of 

20 amino acid monomers (residues) , irrespective of length, 
where amino acid monomer here includes naturally- 
occurring amino acids, naturally-occurring amino acid 
structural variants, and synthetic non-naturally . 
occurring analogs that are capable of participating in 

25 peptide bonds. The terms "protein", "polypeptide", and 
"peptide" explicitly permits of post-translational and 
post-synthetic modifications, such as glycosylation . 

The term "oligopeptide" herein denotes a 
protein, polypeptide, or peptide having 25 or fewer 

30 monomeric subunits. 

The phrases "isolated protein", "isolated 
polypeptide", "isolated peptide" and "isolated 
oligopeptide" refer to a protein (or respectively to a 
polypeptide, peptide, or oligopeptide) that is 

14 
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nonidentical to any protein molecule of identical amino 
acid sequence as found in nature; "isolated" does not 
require, although it does not prohibit, that the protein 
so described has itself been physically removed from its 
5 native environment. 

For example, a protein can be said to be 
"isolated" when it includes amino acid analogues or 
derivatives not found in nature, or includes linkages 
other than standard peptide bonds. 

10 When instead composed entirely of natural amino 

acids linked by peptide bonds, a protein can be said to 
be "isolated" when it exists at a purity not found in 
nature — where purity can be adjudged with respect to the 
presence of proteins of other sequence, with respect to 

15 the presence of non-protein compounds, such as nucleic 
acids, lipids, or other components of a biological cell, 
or when it exists in a composition not found in nature, 
such as in a host cell that does not naturally express 
that protein. 

20 A "purified protein" (equally, a purified 

polypeptide, peptide, or oligopeptide) is an isolated 
protein, as above described, present at a concentration 
of at least 95%, as measured on a weight basis with 
respect to total protein in a composition. A 

25 "substantially purified protein" (equally, a 

substantially purified polypeptide, peptide, or 
oligopeptide) is an isolated protein, as above described, 
present at a concentration of at least 70%, as measured 
on a weight basis with respect to total protein in a 

30 composition. 

As used herein, the phrase "protein isoforms" 
refers to a plurality of proteins having nonidentical 
primary amino acid sequence but that share amino acid 
sequence encoded by at least one common exon. 
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As used herein, the phrase "alternative 
splicing" and its linguistic equivalents includes all 
types of RNA processing that lead to expression of plural 
protein isoforms from a single gene; accordingly, the 
5 phrase "splice variant (s) " and its linguistic equivalents 
embraces mRNAs transcribed from a given gene that, 
however processed, collectively encode plural protein 
isoforms. For example, and by way of illustration only, 
splice variants can include exon insertions, exon 

10 extensions, exon truncations, exon deletions, 

alternatives in the 5' untranslated region ("5' UT") and 
alternatives in the 3' untranslated region ("3' UT"). 
Such 3' alternatives include, for example, differences in 
the site of RNA transcript cleavage and site of poly (A) 

15 addition. See, e.g., Gautheret et al. f Genome Res. 
8:524-530 (1998) . 

As used herein, "orthologues" are separate 
occurrences of the same gene in multiple species. The 
separate occurrences have similar, albeit nonidentical , 

20 amino acid sequences, the degree of sequence similarity 
depending, in part, upon the evolutionary distance of the 
species from a common ancestor having the same gene. 

As used herein, the term "paralogues" indicates 
separate occurrences of a gene in one species. The 

25 separate occurrences have similar, albeit nonidentical, 
amino acid sequences, the degree of sequence similarity 
depending, in part, upon the evolutionary distance from 
the gene duplication event giving rise to the separate 
occurrences . 

30 As used herein, the term "homologues" is 

generic to "orthologues" and "paralogues". 

As used herein, the term "antibody" refers to a 
polypeptide, at least a portion of which is encoded by at 
least one immunoglobulin gene, or fragment thereof, and 

16 
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that can bind specifically to a desired target molecule. 

The term includes naturally-occurring forms, as well as 
fragments and derivatives. 

Fragments within the scope of the term 
5 "antibody" include those produced by digestion with 
various proteases, those produced by chemical cleavage 
and/or chemical dissociation, and those produced 
recombinant ly, so long as the fragment remains capable of . 
specific binding to a target molecule. Among such 

10 fragments are Fab, Fab 1 , Fv, F(ab)' 2 , and single chain Fv 
(scFv) fragments. 

Derivatives within the scope of the term 
include antibodies (or fragments thereof) that have been 
modified in sequence, but remain capable of specific 

15 binding to a target molecule, including: interspecies 
chimeric and humanized antibodies; antibody fusions; 
heteromeric antibody complexes and antibody fusions, such 
as diabodies (bispecif ic antibodies) , single-chain 
diabodies, and intrabodies (see, e.g., Marasco (ed.), 

20 Intracellular Antibodies: Research and Disease 

Applications , Springer-Verlag New York, Inc. (1998) 
(ISBN: 3540641513), the disclosure of which is 
incorporated herein by reference in its entirety) . 

As used herein, antibodies can be produced by 

25 any known technique, including harvest from cell culture 
of native B lymphocytes, harvest from culture of 
hybridomas, recombinant expression systems, and phage 
display. 

As used herein, "antigen" refers to a ligand 
30 that can be bound by an antibody; an antigen need not 

itself be immunogenic. The portions of the antigen that 
make contact with the antibody are denominated 
"epitopes" . 

"Specific binding" refers to the ability of two 
35 molecular species concurrently present in a heterogeneous 
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(inhomogeneous) sample to bind to one another in . 
preference to binding to other molecular species in the 
sample. Typically, a specific binding interaction will 
discriminate over adventitious binding interactions in 
5 the reaction by at least two-fold, more typically by at 
least 10-fold, often at least 100-fold; when used to 
detect analyte, specific binding is sufficiently 
discriminatory when determinative of the presence of the 
analyte in a heterogeneous (inhomogeneous) sample. 
10 Typically, the affinity or avidity of a specific binding 
reaction is least about 10" 7 M, with specific binding 
reactions of greater specificity typically having 
affinity or avidity of at least 10" 8 M to at least about 
10' 9 M. 

15 As used herein, "molecular binding partners" — 

and equivalently, "specific binding partners" — refer to 
pairs of molecules, typically pairs of biomolecules , that 
exhibit specific binding. Nonlimiting examples are 
receptor and ligand, antibody and antigen, and biotin to 

20 any of avidin, streptavidin, neutrAvidin and captAvidin. 

The term "antisense", as used herein, refers to 
a nucleic acid molecule sufficiently complementary in 
sequence, and sufficiently long in that complementary 
sequence, as to hybridize under intracellular conditions 

25 to (i) a target mRNA transcript or (ii) the genomic DNA 
strand complementary to that transcribed to produce the 
target mRNA transcript . 

The term "portion", as used with respect to 
nucleic acids, proteins, and antibodies, is synonymous 

30 with "fragment" . 

NUCLEIC ACID MOLECULES 



35 



In a first aspect, the invention provides 
isolated nucleic acids that encode human AMLP1 , naturally 
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occurring allelic variants, variants having at least 65% 
sequence identity thereto, degenerate variants thereof, 
variants that encode human AMLP1 proteins having 
conservative or moderately conservative substitutions, 
5 cross-hybridizing nucleic acids, and fragments thereof. 

FIG. 3 and FOG. 4 presents the nucleotide 
sequence of the human AMLP1 cDNA clones, with predicted 
amino acid translation; the sequences are further 
presented in the Sequence Listing, incorporated herein by 

10 reference in its entirety, in SEQ ID NO: 1 (full length 
nucleotide sequence of human AMLPla cDNA) , SEQ ID NO: 3 
(full length amino acid coding sequence of human AMLPla) , 
SEQ ID NO: 4 (full length nucleotide sequence of human 
AMLPlb cDNA) and SEQ ID NO: 6 (full length amino acid 

15 coding sequence of human AMLPlb) . 

Unless otherwise indicated, each nucleotide 
sequence is set forth herein as a sequence of 
deoxyribonucleotides . It is intended, however, that the 
given sequence be interpreted as would be appropriate to 

20 the polynucleotide composition: for example, if the 
isolated nucleic acid is composed of RNA, the given 
sequence intends ribonucleotides, with uridine 
substituted for thymidine. 

Unless otherwise indicated, nucleotide 

25 sequences of the isolated nucleic acids of the present 
invention were determined by sequencing a DNA molecule 
that had resulted, directly or indirectly, from at least 
one enzymatic polymerization reaction (e.g., reverse 
transcription and/or polymerase chain reaction) using an 

30 automated sequencer (such as the MegaBACE™ 1000, 

Amersham Biosciences, Sunnyvale, CA, USA) , or by reliance 
upon such sequence or upon genomic sequence prior- 
accessioned into a public database. Unless otherwise 
indicated, all amino acid sequences of the polypeptides 

19 
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of the present invention were predicted by translation 
from the nucleic acid sequences so determined. 

As a consequence, any nucleic acid sequence 
presented herein may contain errors introduced by 
5 erroneous incorporation of nucleotides during 
polymerization, by erroneous base calling by the 
automated sequencer (although such sequencing errors have 
been minimized for the nucleic acids directly determined 
herein, unless otherwise indicated, by the sequencing of 
10 each of the complementary strands of a duplex DNA) , or by 
similar errors accessioned into the public database. Such 
errors can readily be identified and corrected by 
resequencing of the genomic locus using standard 
techniques . 

15 Single nucleotide polymorphisms (SNPs) occur 

frequently in eukaryotic genomes - more than 1.4 million 
SNPs have already identified in the human genome, 
International Human Genome Sequencing Consortium, Nature 
409:860 - 921 (2001) - and the sequence determined from 

20 one individual of a species may differ from other allelic 
forms present within the population. Additionally, small 
deletions and insertions, rather than single nucleotide 
polymorphisms, are not uncommon in the general 
population, and often do not alter the function of the 

25 protein. 

Accordingly, it is an aspect of the present 
invention to provide nucleic acids not only identical in 
sequence to those described with particularity herein, 
but also to provide isolated nucleic acids at least about 

30 65% identical in sequence to those described with 

particularity herein, typically at least about 70%, 75%, 
80%, 85%, or 90% identical in sequence to those described 
with particularity herein, usefully at least about 91%, 
92%, 93%, 94%, or 95% identical in sequence to those 

35 described with particularity herein, usefully at least 

20 
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about 96%, 97%, 98% , or 99% identical in sequence to 
those described with particularity herein, and, most 
conservatively, at least about 99.5%, 99.6%, 99.7%, 99.8% 
and 99.9% identical in sequence to those described with 
5 particularity herein. These sequence variants can be 
naturally occurring or can result from human 
intervention, as by random or directed mutagenesis. 

For purposes herein, percent identity of two 
nucleic acid sequences is determined using the procedure 
10 of Tatiana et al . , "Blast 2 sequences - a new tool for 
comparing protein and nucleotide sequences", FEMS 
Microbiol Lett. 174:247-250 (1999), which procedure is 
effectuated by the computer program BLAST 2 SEQUENCES, 
available online at 

15 

http : //www . ncbi . nlm . nih . gov/blast /bl2seq/bl2 . html . 

To assess percent identity of nucleic acids, the BLASTN 
module of BLAST 2 SEQUENCES is used with default values 
20 of (i) reward for a match: 1; (ii) penalty for a 
mismatch: -2; (iii) open gap 5 and extension gap 2 
penalties; (iv) gap X_dropoff 50 expect 10 word size 11 
filter, and both sequences are entered in their 
entireties . 

25 As is well known, the genetic code is 

degenerate, with each amino acid except methionine 
translated from a plurality of codons, thus permitting a 
plurality of nucleic acids of disparate sequence to 
encode the identical protein. As is also well known, 

30 codon choice for optimal expression varies from species 
to species. The isolated nucleic acids of the present 
invention being useful for expression of human AMLP1 
proteins and protein fragments, it is, therefore, another 
aspect of the present invention to provide isolated 

35 nucleic acids that encode human AMLP1 proteins and 

21 
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portions thereof not only identical in sequence to those 
described with particularity herein, but degenerate 
variants thereof as well. 

As is also well known, amino acid substitutions 
5 occur frequently among natural allelic variants, with 
conservative substitutions often occasioning only de 
minimis change in protein function. 

Accordingly, it is an aspect of the present 
invention to provide nucleic acids not only identical in 

10 sequence to those described with particularity herein, 
but also to provide isolated nucleic acids that encode 
human AMLP1, and portions thereof, having conservative 
amino acid substitutions, and also to provide isolated 
nucleic acids that encode human AMLP1, and portions 

15 thereof, having moderately conservative amino acid 
substitutions . 

Although there are a variety of metrics for 
calling conservative amino acid substitutions, based 
primarily on either observed changes among evolutionarily 

20 related proteins or on predicted chemical similarity, for 
purposes herein a conservative replacement is any change 
having a positive value in the PAM250 log-likelihood 
matrix reproduced herein below (see Gonnet et al . , 
Science 256 (5062) : 1443-5 (1992)): 

25 
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For purposes herein, a "moderately conservative" 
replacement is any change having a nonnegative value in 
the PAM250 log-likelihood matrix reproduced herein above. 
15 As is also well known in the art, relatedness 

of nucleic acids can also be characterized using a 
functional test, the ability of the two nucleic acids to 
base-pair to one another at defined hybridization 
stringencies . 

20 It is, therefore, another aspect of the 

invention to provide isolated nucleic acids not only 
identical in sequence to those described with 
particularity herein, but also to provide isolated 
nucleic acids ( "cross -hybridizing nucleic acids") that 

25 hybridize under high stringency conditions (as defined 
herein below) to all or to a portion of various of the 
isolated AMLP1 nucleic acids of the present invention 
("reference nucleic acids"), as well as cross-hybridizing 
nucleic acids that hybridize under moderate stringency 

30 conditions to all or to a portion of various of the 

isolated AMLP1 nucleic acids of the present invention. 

Such cross -hybridizing nucleic acids are 
useful, inter alia, as probes for, and to drive 
expression of, proteins related to the proteins of the 

35 present invention as alternative isoforms, homologues, 
paralogues, and orthologues. Particularly useful 
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orthologues are those from other primate species, such as 
chimpanzee, rhesus macaque, monkey, baboon, orangutan, 
and gorilla; from rodents, such as rats, mice, guinea 
pigs/ from lagomorphs, such as rabbits; and from domestic 
5 livestock, such as cow, pig, sheep, horse, goat and 
chicken . 

For purposes herein, high stringency conditions 
are defined as aqueous hybridization (i.e., free of 
formamide) in 6X SSC (where 20X SSC contains 3.0 M NaCl 

10 and 0.3 M sodium citrate), 1% SDS at 65°C for at least 8 
hours, followed by one or more washes in 0.2X SSC, 0.1% 
SDS at 65°C. For purposes herein, moderate stringency 
conditions are defined as aqueous hybridization (i.e., 
free of formamide) in 6X SSC, 1% SDS at 6 5°C for at least 

15 8 hours, followed by one or more washes in 2x SSC, 0.1% 
SDS at room temperature. 

The hybridizing portion of the reference 
nucleic acid is typically at least 15 nucleotides in 
length, often at least 17 nucleotides in length. Often, 

20 however, the hybridizing portion of the reference nucleic 
acid is at least 20 nucleotides in length, 25 nucleotides 
in length, and even 30 nucleotides, 35 nucleotides, 40 
nucleotides, and 50 nucleotides in length. Of course, 
cross-hybridizing nucleic acids that hybridize to a 

25 larger portion of the reference nucleic acid - for 

example, to a portion of at least 50 nt, at least 100 nt, 
at least 150 nt, 200 nt, 250 nt, 300 nt, 350 nt, 400 nt, 
450 nt, or 500 nt or more - or even to the entire length 
of the reference nucleic acid, are also useful. 

30 The hybridizing portion of the cross- 

hybridizing nucleic acid is at least 75% identical in 
sequence to at least a portion of the reference nucleic 
acid. Typically, the hybridizing portion of the cross- 
hybridizing nucleic acid is at least 80%, often at least 

35 85%, 86%, 87%, 88%, 89% or even at least 90% identical in 
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sequence to at least a portion of the reference nucleic 
acid. Often, the hybridizing portion of the cross- 
hybridizing nucleic acid will be at least 91%, 92%, 93%, 
94%, 95%, 96%, 97%, 98%, or 99% identical in sequence to 
5 at least a portion of the reference nucleic acid 
sequence. At times, the hybridizing portion of the 
cross-hybridizing nucleic acid will be at least 99.5% 
identical in sequence to at least a portion of the 
reference nucleic acid. 
10 The invention also provides fragments of 

various of the isolated nucleic acids of the present 
invention . 

By "fragments" of a reference nucleic acid is 
here intended isolated nucleic acids, however obtained, 

15 that have a nucleotide sequence identical to a portion of 
the reference nucleic acid sequence, which portion is at 
least 17 nucleotides and less than the entirety of the 
reference nucleic acid. As so defined, "fragments" need 
not be obtained by physical fragmentation of the 

20 reference nucleic acid, although such provenance is not 
thereby precluded. 

In theory, an oligonucleotide of 17 nucleotides 
is of sufficient length as to occur at random less 
frequently than once in the three gigabase human genome, 

25 and thus to provide a nucleic acid probe that can 

uniquely identify the reference sequence in a nucleic 
acid mixture of genomic complexity. As is well known, 
further specificity can be obtained by probing nucleic 
acid samples of subgenomic complexity, and/or by using 

30 plural fragments as short as 17 nucleotides in length 

collectively to prime amplification of nucleic acids, as, 
e.g., by polymerase chain reaction (PCR) . 

As further described herein below, nucleic acid 
fragments that encode at least 6 contiguous amino acids 

35 (i.e., fragments of 18 nucleotides or more) are useful in 
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directing the expression or the synthesis of peptides 
that have utility in mapping the epitopes of the protein 
encoded by the reference nucleic acid. See, e.g., Geysen 
et al . , "Use of peptide synthesis to probe viral antigens 
5 for epitopes to a resolution of a single amino acid, " 

Proc. Natl. Acad. Sci . USA 81:3998-4002 (1984); and U.S. 
Pat. Nos. 4,708,871 and 5,595,915, the disclosures of 
which are incorporated herein by reference in their 
entireties . 

10 As further described herein below, fragments 

that encode at least 8 contiguous amino acids (i.e., 
fragments of 24 nucleotides or more) are useful in 
directing the expression or the synthesis of peptides 
that have utility as immunogens. See, e.g., Lerner, 

15 "Tapping the immunological repertoire to produce 

antibodies of predetermined specificity," Nature 299:592- 
596 (1982); Shinnick et al . , "Synthetic peptide 
immunogens as vaccines," Annu. Rev. Microbiol. 37:425-46 
(1983); Sutcliffe et al . , "Antibodies that react with 

20 predetermined sites on proteins," Science 219:660-6 

(1983) , the disclosures of which are incorporated herein 
by reference in their entireties. 

The nucleic acid fragment of the present 
invention is thus at least 17 nucleotides in length, 

25 typically at least 18 nucleotides in length, and often at 
least 24 nucleotides in length. Often, the nucleic acid 
of the present invention is at least 25 nucleotides in 
length, and even 30 nucleotides, 35 nucleotides, 40 
nucleotides, or 45 nucleotides in length. Of course, 

30 larger fragments having at least 50 nt, at least 100 nt, 
at least 150 nt, 200 nt, 250 nt, 300 nt, 350 nt, 400 nt, 
450 nt, or 500 nt or more are also useful, and at times 
preferred. 

Having been based upon the mining of genomic 
35 sequence, rather than upon surveillance of expressed 

26 
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message, the present invention further provides isolated 
genome -derived nucleic acids that include portions of the 
AMLP1 gene. 

The invention particularly provides genome- 
5 derived single exon probes. 

As further described in commonly owned and 
copending U.S. patent application serial nos . 09/864,761, 
filed May 23, 2001; 09/774,203, filed January 29, 2001; 
and 09/632,366, filed August 3, 2000, the disclosures of 

10 which are incorporated herein by reference in their 

entireties, "a single exon probe" comprises at least part 
of an exon ("reference exon") and can hybridize 
detectably under high stringency conditions to 
transcript -derived nucleic acids that include the 

15 reference exon. The single exon probe will not, however, 
hybridize detectably under high stringency conditions to 
nucleic acids that lack the reference exon and instead 
consist of one or more exons that are found adjacent to 
the reference exon in the genome . 

20 Genome -derived single exon probes typically 

further comprise, contiguous to a first end of the exon 
portion, a first intronic and/or intergenic sequence that 
is identically contiguous to the exon in the genome. 
Often, the genome -derived single exon probe further 

25 comprises, contiguous to a second end of the exonic 
portion, a second intronic and/or intergenic sequence 
that is identically contiguous to the exon in the genome. 

The minimum length of genome -derived single 
exon probes is defined by the requirement that the exonic 

30 portion be of sufficient length to hybridize under high 
stringency conditions to transcript-derived nucleic 
acids. Accordingly, the exon portion is at least 17 
nucleotides, typically at least 18 nucleotides, 20 
nucleotides, 24 nucleotides, 25 nucleotides or even 30, 

35 35, 40, 45, or 50 nucleotides in length, and can usefully 

27 
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include the entirety of the exon, up to 100 nt, 150 nt, 
200 nt, 250 nt, 300 nt, 350 nt, 400 nt or even 500 nt or 
more in length. 

The maximum length of genome -derived single 
5 exon probes is defined by the requirement that the probes 
contain portions of no more than one exon, that is, be 
unable to hybridize detectably under high stringency 
conditions to nucleic acids that lack the reference exon 
but include one or more exons that are found adjacent to 

10 the reference exon the genome. 

Given variable spacing of exons through 
eukaryotic genomes, the maximum length of single exon 
probes of the present invention is typically no more than 
25 kb, often no more than 20 kb, 15 kb, 10 kb or 7.5 kb, 

15 or even no more than 5 kb, 4 kb, 3 kb, or even no more 
than about 2.5 kb in length. 

The genome -derived single exon probes of the 
present invention can usefully include at least a first 
terminal priming sequence not found in contiguity with 

20 the rest of the probe sequence in the genome, and often 
will contain a second terminal priming sequence not found 
in contiguity with the rest of the probe sequence in the 
genome . 

The present invention also provides isolated 
25 genome -derived nucleic acids that include nucleic acid 

sequence elements that control transcription of the AMLP1 
gene. 

With a complete draft of the human genome now 
available, genomic sequences that are within the vicinity 
30 of the AMLP1 coding region (and that are additional to 

those described with particularity herein) can readily be 
obtained by PCR amplification. 

The isolated nucleic acids of the present 
invention can be composed of natural nucleotides in 
35 native 5' -3' phosphodiester internucleoside linkage - 
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e.g., DNA or RNA - or can contain any or all of 
nonnatural nucleotide analogues, nonnative 
internucleoside bonds, or post-synthesis modifications, 
either throughout the length of the nucleic acid or 
5 localized to one or more portions thereof. 

As is well known in the art, when the isolated 
nucleic acid is used as a hybridization probe, the range 
of such nonnatural analogues, nonnative internucleoside 
bonds, or post -synthesis modifications will be limited to 

10 those that permit sequence-discriminating basepairing of 
the resulting nucleic acid. When used to direct 
expression or RNA or protein in vitro or in vivo, the 
range of such nonnatural analogues, nonnative 
internucleoside bonds, or post-synthesis modifications 

15 will be limited to those that permit the nucleic acid to 
function properly as a polymerization substrate. When 
the isolated nucleic acid is used as a therapeutic agent, 
the range of such changes will be limited to those that 
do not confer toxicity upon the isolated nucleic acid. 

20 For example, when desired to be used as probes, 

the isolated nucleic acids of the present invention can 
usefully include nucleotide analogues that incorporate 
labels that are directly detectable, such as radiolabels 
or f luorophores, or nucleotide analogues that incorporate 

25 labels that can be visualized in a subsequent reaction, 
such as biotin or various haptens. 

Common radiolabeled analogues include those 
labeled with 33 P, 32 P, and 35 S, such as a- 32 P-dATP, a- 32 P- 
dCTP, a- 32 P-dGTP, a- 32 P-dTTP, a- 32 P-3 ' dATP, a- 32 P-ATP, a- 32 P- 

30 CTP, a- 32 P-GTP, a- 32 P-UTP, a- 35 S-dATP, y- 35 S-GTP, y- 33 P-dATP, 
and the like. 

Commercially available fluorescent nucleotide 
analogues readily incorporated into the nucleic acids of 
the present invention include Cy3-dCTP, Cy3-dUTP, Cy5- 
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dCTP, Cy3-dUTP (Amersham Pharmacia Biotech, Piscataway, 
New Jersey, USA), f luorescein-12-dUTP, 

tetramethylrhodamine-6-dUTP, Texas Red®-5-dUTP, Cascade 
Blue®-7-dUTP, BODIPY® FL-14-dUTP, BODIPY® TMR-14-dUTP, 
5 BODIPY® TR-14-dUTP, Rhodamine Green™-5-dUTP, Oregon 
Green® 488-5-dUTP, Texas Red®-12-dUTP, BODIPY® 
630/650-14-dUTP, BODIPY® 650/665-14-dUTP, Alexa Fluor® 
488-5-dUTP, Alexa Fluor® 532-5-dUTP, Alexa Fluor® 
568-5-dUTP, Alexa Fluor® 594-5-dUTP, Alexa Fluor® 

10 546-14-dUTP, fluorescein- 12 -UTP, 

tetramethylrhodamine-6-UTP, Texas Red®-5-UTP, Cascade 
Blue®-7-UTP, BODIPY® FL-14-UTP, BODIPY® TMR-14-UTP, 
BODIPY® TR-14-UTP, Rhodamine Green™-5-UTP, Alexa Fluor® 
488-5-UTP, Alexa Fluor® 546-14-UTP (Molecular Probes, 

15 Inc. Eugene, OR, USA) . 

Protocols are available for custom synthesis of 
nucleotides having other f luorophores . Henegariu et al . , 
"Custom Fluorescent-Nucleotide Synthesis as an 
Alternative Method for Nucleic Acid Labeling, " Nature 

20 Biotechnol. 18:345 - 348 (2000), the disclosure of which 
is incorporated herein by reference in its entirety. 

Haptens that are commonly conjugated to 
nucleotides for subsequent labeling include biotin 
(biotin-ll-dUTP, Molecular Probes, Inc., Eugene, OR, USA; 

25 biotin-21-UTP, biotin-21-dUTP, Clontech Laboratories, 
Inc., Palo Alto, CA, USA), digoxigenin (DIG-ll-dUTP, 
alkali labile, DIG-11-UTP, Roche Diagnostics Corp., 
Indianapolis, IN, USA), and dinitrophenyl 
(dinitrophenyl-ll-dUTP, Molecular Probes, Inc., Eugene, 

30 OR, USA) . 

As another example, when desired to be used for 
antisense inhibition of transcription or translation, the 
isolated nucleic acids of the present invention can 
usefully include altered, often nuclease-resistant , 
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internucleoside bonds. See Hartmann et al . (eds.), 
Manual of Antisense Methodology (Perspectives in 
Antisense Science) , Kluwer Law International (1999) 
(ISBN:079238539X) ; Stein et al . (eds.), Applied 
5 Antisense Oligonucleotide Technology , Wiley-Liss (cover 
(1998) (ISBN: 0471172790); Chadwick et al . (eds.), 
Oligonucleotides as Therapeutic Agents - Symposium No. 
209 , John Wiley & Son Ltd (1997) (ISBN: 0471972797), the 
disclosures of which are incorporated herein by reference 

10 in their entireties. Such altered internucloside bonds 
are often desired also when the isolated nucleic acid of 
the present invention is to be used for targeted gene 
correction, Gamper et al., Nucl. Acids Res. 
28 (21) :4332-4339 (2000), the disclosures of which are 

15 incorporated herein by reference in its entirety. 

Modified oligonucleotide backbones often 
preferred when the nucleic acid is to be used for 
antisense purposes are, for example, phosphorothioates, 
chiral phosphorothioates, phosphorodithioates, 

20 phosphotriesters, aminoalkylphosphotriesters , methyl and 
other alkyl phosphonates including 3 1 -alkylene 
phosphonates and chiral phosphonates, phosphinates, 
phosphoramidates including 3 ' -amino phosphoramidate and 
aminoalkylphosphoramidates , thionophosphoramidates , 

25 thionoalkylphosphonates , thionoalkylphosphotriesters , and 
boranophosphates having normal 3 1 -5 ' linkages, 2 1 -5 1 
linked analogs of these, and those having inverted 
polarity wherein the adjacent pairs of nucleoside units 
are linked 3"-5» to 5' -3' or 2' -5' to 5' -2'. 

30 Representative U.S. patents that teach the preparation of 
the above phosphorus -containing linkages include, but are 
not limited to, U.S. Pat. Nos. 3,687,808; 4,469,863; 
4,476,301; 5,023,243; 5,177,196; 5,188,897; 5,264,423; 
5,276,019; 5,278,302; 5,286,717; 5,321,131; 5,399,676; 

35 5,405,939; 5,453,496; 5,455,233; 5,466,677; 5,476,925; 
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5,519,126; 5,536,821; 5,541,306; 5,550,111; 5,563,253; 
5,571,799; 5,587,361; and 5,625,050, the disclosures of 
which are incorporated herein by reference in their 
entireties . 

5 Preferred modified oligonucleotide backbones 

for antisense use that do not include a phosphorus atom 
have backbones that are formed by short chain alkyl or 
cycloalkyl internucleoside linkages, mixed heteroatom and 
alkyl or cycloalkyl internucleoside linkages, or one or 

10 more short chain heteroatomic or heterocyclic 

internucleoside linkages. These include those having 
morpholino linkages (formed in part from the sugar 
portion of a nucleoside) ; siloxane backbones; sulfide, 
sulfoxide and sulfone backbones; formacetyl and 

15 thioformacetyl backbones; methylene formacetyl and 

thioformacetyl backbones; alkene containing backbones; 
sulfamate backbones; methyleneimino and 

methylenehydrazino backbones; sulfonate and sulfonamide 

backbones; amide backbones; and others having mixed N, O, 
20 S and CH 2 component parts. Representative U.S. patents 

that teach the preparation of the above backbones 

include, but are not limited to, U.S. Pat. Nos. 

5,034,506; 5,166,315; 5,185,444; 5,214,134; 5,216,141; 

5,235,033; 5,264,562; 5,264,564; 5,405,938; 5,434,257; 
25 5,466,677; 5,470,967; 5,489,677; 5,541,307; 5,561,225; 

5,596,086; 5,602,240; 5,610,289; 5,602,240; 5,608,046; 

5,610,289; 5,618,704; 5,623,070; 5,663,312; 5,633,360; 

5,677,437; and 5,677,439, the disclosures of which are 

incorporated herein by reference in their entireties. 
30 In other preferred oligonucleotide mimetics, 

both the sugar and the internucleoside linkage are 

replaced with novel groups, such as peptide nucleic acids 

(PNA) . 

In PNA compounds, the phosphodiester backbone 
35 of the nucleic acid is replaced with an amide-containing 
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backbone, in particular by repeating N- (2-aminoethyl) 
glycine units linked by amide bonds. Nucleobases are 
bound directly or indirectly to aza nitrogen atoms of the 
amide portion of the backbone, typically by methylene 
5 carbonyl linkages. 

The uncharged nature of the PNA backbone 
provides PNA/DNA and PNA/RNA duplexes with a higher 
thermal stability than is found in DNA/DNA and DNA/RNA 
duplexes, resulting from the lack of charge repulsion 

10 between the PNA and DNA or RNA strand. In general, the 
Tm of a PNA/DNA or PNA/RNA duplex is 1°C higher per base 
pair than the Tm of the corresponding DNA/DNA or DNA/RNA 
duplex (in 100 mM NaCl) . 

The neutral backbone also allows PNA to form 

15 stable DNA duplexes largely independent of salt 
concentration. At low ionic strength, PNA can be 
hybridized to a target sequence at temperatures that make 
DNA hybridization problematic or impossible. And unlike 
DNA/DNA duplex formation, PNA hybridization is possible 

20 in the absence of magnesium. Adjusting the ionic 

strength, therefore, is useful if competing DNA or RNA is 
present in the sample, or if the nucleic acid being 
probed contains a high level of secondary structure. 

PNA also demonstrates greater specificity in 

25 binding to complementary DNA. A PNA/DNA mismatch is more 
destabilizing than DNA/DNA mismatch. A single mismatch 
in mixed a PNA/DNA 15-mer lowers the Tm by 8-20°C (15°C 
on average) . In the corresponding DNA/DNA duplexes, a 
single mismatch lowers the Tm by 4-16°C (ll°C on 

30 average) . Because PNA probes can be significantly 

shorter than DNA probes, their specificity is greater. 

Additionally, nucleases and proteases do not 
recognize the PNA polyamide backbone with nucleobase 
sidechains. As a result, PNA oligomers are resistant to 
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degradation by enzymes, and the lifetime of these 
compounds is extended both in vivo and in vitro. In 
addition, PNA is stable over a wide pH range. 

Because its backbone is formed from amide 
5 bonds, PNA can be synthesized using a modified peptide 
synthesis protocol. PNA oligomers can be synthesized by 
both Fmoc and tBoc methods. Representative U.S. patents 
that teach the preparation of PNA compounds include, but 
are not limited to, U.S. Pat. Nos. 5,539,082; 5,714,331; 

10 and 5,719,262, each of which is herein incorporated by 
reference; automated PNA synthesis is readily achievable 
on commercial synthesizers (see, e.g., " PNA User's 
Guide," Rev. 2, February 1998, Perseptive Biosystems Part 
No. 60138, Applied Biosystems, Inc., Foster City, CA) . 

15 PNA chemistry and applications are reviewed, 

inter alia, in Ray et al . , FASEB J. 14 (9) : 1041-60 (2000); 
Nielsen et al . , Pharmacol Toxicol. 86(1) :3-7 (2000); 
Larsen et al . , Biochim Biophys Acta. 1489 ( 1) : 159-66 
(1999); Nielsen, Curr. Opin. Struct. Biol. 9(3):353-7 

20 (1999), and Nielsen, Curr. Opin. Biotechnol . 10(l):71-5 
(1999) , the disclosures of which are incorporated herein 
by reference in their entireties. 

Differences from nucleic acid compositions 
found in nature - e.g., nonnative bases, altered 

25 internucleoside linkages, post-synthesis modification - 
can be present throughout the length of the nucleic acid 
or can, instead, usefully be localized to discrete 
portions thereof. As an example of the latter, chimeric 
nucleic acids can be synthesized that have discrete DNA 

30 and RNA domains and demonstrated utility for targeted 
gene repair, as further described in U.S. Pat. Nos. 
5,760,012 and 5,731,181, the disclosures of which are 
incorporated herein by reference in their entireties. As 
another example, chimeric nucleic acids comprising both 

35 DNA and PNA have been demonstrated to have utility in 
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modified PCR reactions. See Misra et al . , Biochem. 37: 
1917-1925 (1998); see also Finn et al . , Nucl . Acids Res, 
24: 3357-3363 (1996), incorporated herein by reference. 

Unless otherwise specified, nucleic acids of 
5 the present invention can include any topological 

conformation appropriate to the desired use; the term 
thus explicitly comprehends, among others, single- 
stranded , double - s t randed , t r iplexed , quadrupl exed , 
partially double-stranded, partially- triplexed, 

10 partially-quadruplexed, branched, hairpinned, circular, 
and padlocked conformations. Padlock conformations and 
their utilities are further described in Baner et al., 
Curr. Opin. Biotechnol. 12:11-15 (2001); Escude et al . , 
Proc. Natl. Acad. Sci. USA 14 ; 96 (19) : 10603-7 (1999); 

15 Nilsson et al., Science 265 (5181) : 2085-8 (1994), the 

disclosures of which are incorporated herein by reference 
in their entireties. Triplex and quadruplex 
conformations, and their utilities, are reviewed in 
Praseuth et al . , Biochim. Biophys. Acta. 1489 (1) : 181-206 

20 (1999); Fox, Curr. Med. Chem. 7(1): 17-37 (2000); 

Kochetkova et al., Methods Mol . Biol. 130:189-201 (2000); 
Chan et al., J. Mol. Med. 75(4):267-82 (1997), the 
disclosures of which are incorporated herein by reference 
in their entireties. 

25 The nucleic acids of the present invention can 

be detectably labeled. 

Commonly-used labels include radionuclides, 
such as 32 P, 33 P, 35 S, 3 H (and for NMR detection, 13 C and 
15 N) , haptens that can be detected by specific antibody 

30 or high affinity binding partner (such as avidin) , and 
f luorophores . 

As noted above, detectable labels can be 
incorporated by inclusion of labeled nucleotide analogues 
in the nucleic acid. Such analogues can be incorporated 

35 by enzymatic polymerization, such as by nick translation, 
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random priming, polymerase chain reaction (PCR) , terminal 
transferase tailing, and end- filling of overhangs, for 
DNA molecules, and in vitro transcription driven, e.g., 
from phage promoters, such as T7, T3, and SP6 , for RNA 
5 molecules. Commercial kits are readily available for 
each such labeling approach. 

Analogues can also be incorporated during 
automated solid phase chemical synthesis. 

As is well known, labels can also be 

10 incorporated after nucleic acid synthesis, with the 5' 
phosphate and 3 1 hydroxyl providing convenient sites for 
post-synthetic covalent attachment of detectable labels. 

Various other post -synthetic approaches permit 
internal labeling of nucleic acids. 

15 For example, fluorophores can be attached using 

a cisplatin reagent that reacts with the N7 of guanine 
residues (and, to a lesser extent, adenine bases) in DNA, 
RNA, and PNA to provide a stable coordination complex 
between the nucleic acid and fluorophore label (Universal 

20 Linkage System) (available from Molecular Probes, Inc., 
Eugene, OR, USA and Amersham Pharmacia Biotech, 
Piscataway, NJ, USA); see Alers et al . , Genes, 
Chromosomes & Cancer, Vol. 25, pp. 301 - 305 (1999); 
Jelsma et al . , J. NIH Res. 5:82 (1994); Van Belkum et 

25 al., BioTechniques 16:148-153 (1994), incorporated herein 
by reference. As another example, nucleic acids can be 
labeled using a disulf ide-containing linker (FastTag™ 
Reagent, Vector Laboratories, Inc., Burlingame, CA, USA) 
that is photo- or thermally coupled to the target nucleic 

30 acid using aryl azide chemistry; after reduction, a free 
thiol is available for coupling to a hapten, fluorophore, 
sugar, affinity ligand, or other marker. 

Multiple independent or interacting labels can 
be incorporated into the nucleic acids of the present 

35 invention. 
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For example, both a fluorophore and a moiety 
that in proximity thereto acts to quench fluorescence can 
be included to report specific hybridization through 
release of fluorescence quenching, Tyagi et al., Nature 
5 Biotechnol. 14: 303-308 (1996); Tyagi etal. t Nature 
Biotechnol. 16, 49-53 (1998); Sokol et al . , Proc. Natl. 
Acad. Sci. USA 95: 11538-11543 (1998); Kostrikis et al . , 
Science 279:1228-1229 (1998); Marras et al . , Genet. 
Anal. 14: 151-156 (1999); U.S. Pat. Nos . 5,846,726, 

10 5,925,517, 5925517, or to report exonucleotidic excision, 
U.S. Pat. No. 5,538,848; Holland et al . , Proc . Natl. 
Acad. Sci. USA 88:7276-7280 (1991); Heid et al . , Genome 
Res. 6(10):986-94 (1996); Kuimelis et al . , Nucleic Acids 
Symp Ser. (37) :255-6 (1997); U.S. Patent No. 5,723,591, 

15 the disclosures of which are incorporated herein by 
reference in their entireties. 

So labeled, the isolated nucleic acids of the 
present invention can be used as probes, as further 
described below. 

20 Nucleic acids of the present invention can also 

usefully be bound to a substrate. The substrate can 
porous or solid, planar or non-planar, unitary or 
distributed; the bond can be covalent or noncovalent. 
Bound to a substrate, nucleic acids of the present 

25 invention can be used as probes in their unlabeled state. 

For example, the nucleic acids of the present 
invention can usefully be bound to a porous substrate, 
commonly a membrane, typically comprising nitrocellulose, 
nylon, or positively-charged derivatized nylon; so 

30 attached, the nucleic acids of the present invention can 
be used to detect AMLP1 nucleic acids present within a 
labeled nucleic acid sample, either a sample of genomic 
nucleic acids or a sample of transcript -derived nucleic 
acids, e.g. by reverse dot blot. 
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The nucleic acids of the present invention can 
also usefully be bound to a solid substrate, such as 
glass, although other solid materials, such as amorphous 
silicon, crystalline silicon, or plastics, can also be 
5 used. Such plastics include polymethylacrylic, 
polyethylene , polypropylene , polyacrylate , 
polymethylmethacrylate , polyvinylchloride , 
polytetraf luoroethylene , polystyrene , polycarbonate , 
polyacetal , polysul f one , celluloseacetat e , 

10 cellulosenitrate, nitrocellulose, or mixtures thereof. 

Typically, the solid substrate will be 
rectangular, although other shapes, particularly disks 
and even spheres, present certain advantages. 
Particularly advantageous alternatives to glass slides as 

15 support substrates for array of nucleic acids are optical 
discs, as described in Demers, "Spatially Addressable 
Combinatorial Chemical Arrays in CD-ROM Format , " 
international patent publication WO 98/12559, 
incorporated herein by reference in its entirety. 

20 The nucleic acids of the present invention can 

be attached covalently to a surface of the support 
substrate or applied to a derivatized surface in a 
chaotropic agent that facilitates denaturation and 
adherence by presumed noncovalent interactions, or some 

25 combination thereof. 

The nucleic acids of the present invention can 
be bound to a substrate to which a plurality of other 
nucleic acids are concurrently bound, hybridization to 
each of the plurality of bound nucleic acids being 

30 separately detectable. At low density, e.g. on a porous 
membrane, these substrate -bound collections are typically 
denominated macroarrays; at higher density, typically on 
a solid support, such as glass, these substrate bound 
collections of plural nucleic acids are colloquially 

35 termed microarrays. As used herein, the term microarray 



WO 03/037931 PCT/US02/35129 

includes arrays of all densities. It is, therefore, 
another aspect of the invention to provide microarrays 
that include the nucleic acids of the present invention. 
The isolated nucleic acids of the present 
5 invention can be used as hybridization probes to detect, 
characterize, and quantify AMLP1 nucleic acids in, and 
isolate AMLP1 nucleic acids from, both genomic and 
transcript -derived nucleic acid samples. When free in 
solution, such probes are typically, but not invariably, 
10 detectably labeled; bound to a substrate, as in a 

microarray, such probes are typically, but not invariably 
unlabeled . 

For example, the isolated nucleic acids of the 
present invention can be used as probes to detect and 

15 characterize gross alterations in the AMLP1 genomic 

locus, such as deletions, insertions, translocations, and 
duplications of the AMLP1 genomic locus through 
fluorescence in situ hybridization (FISH) to chromosome 
spreads. See, e.g., Andreeff et al. (eds.), Introduction 

20 to Fluorescence In Situ Hybridization; Principles and 
Clinical Applications , John Wiley & Sons (1999) (ISBN: 
0471013455) , the disclosure of which is incorporated 
herein by reference in its entirety. The isolated 
nucleic acids of the present invention can be used as 

25 probes to assess smaller genomic alterations using, e.g., 
Southern blot detection of restriction fragment length 
polymorphisms. The isolated nucleic acids of the present 
invention can be used as probes to isolate genomic clones 
that include the nucleic acids of the present invention, 

30 which thereafter can be restriction mapped and sequenced 
to identify deletions, insertions, translocations, and 
substitutions (single nucleotide polymorphisms, SNPs) at 
the sequence level . 

The isolated nucleic acids of the present 

35 invention can also be used as probes to detect, 
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characterize, and quantify AMLPl nucleic acids in, and 
isolate AMLPl nucleic acids from, transcript -derived 
nucleic acid samples. 

For example, the isolated nucleic acids of the 
5 present invention can be used as hybridization probes to 
detect, characterize by length, and quantify AMLPl mRNA 
by northern blot of total or poly-A + - selected RNA 
samples. For example, the isolated nucleic acids of the 
present invention can be used as hybridization probes to 

10 detect, characterize by location, and quantify AMLPl 

message by in situ hybridization to tissue sections (see, 
e.g., Schwarchzacher et al . , In Situ Hybridization , 
Springer -Ver lag New York (2000) (ISBN: 0387915966), the 
disclosure of which is incorporated herein by reference 

15 in its entirety) . For example, the isolated nucleic 
acids of the present invention can be used as 
hybridization probes to measure the representation of 
AMLPl clones in a cDNA library. For example, the 
isolated nucleic acids of the present invention can be 

20 used as hybridization probes to isolate AMLPl nucleic 
acids from cDNA libraries, permitting sequence level 
characterization of AMLPl messages, including 
identification of deletions, insertions, truncations - 
including deletions, insertions, and truncations of exons 

25 in alternatively spliced forms — and single nucleotide 
polymorphisms . 

All of the aforementioned probe techniques are 
well within the skill in the art, and are described at 
greater length in standard texts such as Sambrook et al . , 

30 Molecular Cloning: A Laboratory Manual (3 rd ed.), Cold 

Spring Harbor Laboratory Press (2001) (ISBN: 0879695773); 
Ausubel et al. (eds.), Short Protocols in Molecular 
Biology: A Compendium of Methods from Current Protocols 
in Molecular Biology (4 th ed.), John Wiley & Sons, 1999 

35 (ISBN: 047132938X) ; and Walker et al . (eds.), The Nucleic 
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Acids Protocols Handbook , Humana Press (2000) (ISBN: 
0896034593), the disclosures of which are incorporated 
herein by reference in their entirety. 

As described in the Examples herein below, the 
5 nucleic acids of the present invention can also be used 
to detect and quantify AMLP1 nucleic acids in transcript - 
derived samples — that is, to measure expression of the 
AMLP1 gene — when included in a microarray. Measurement 
of AMLP1 expression has particular utility in diagnosis 

10 and treatment of cancer, as further described in the 
Examples herein below. 

As would be readily apparent to one of skill in 
the art, each AMLP1 nucleic acid probe - whether labeled, 
substrate-bound, or both — is thus currently available 

15 for use as a tool for measuring the level of AMLP1 

expression in each of the tissues in which expression has 
already been confirmed, notably brain, liver, kidney, and 
adrenal gland, as well as prostate, testis, lung, 
placenta, skeletal muscle, heart, bone marrow and colon 

20 tumor. The utility is specific to the probe: under high 
stringency conditions, the probe reports the level of 
expression of message specifically containing that 
portion of the human AMLP1 gene included within the 
probe . 

25 Measuring tools are well known in many arts, 

not just in molecular biology, and are known to possess 
credible, specific, and substantial utility. For 
example, U.S. Patent No. 6,016,191 describes and claims a 
tool for measuring characteristics of fluid flow in a 

30 hydrocarbon well; U.S. Patent No. 6,042,549 describes and 
claims a device for measuring exercise intensity; U.S. 
Patent No. 5,889,351 describes and claims a device for 
measuring viscosity and for measuring characteristics of 
a fluid; U.S. Patent No. 5,570,694 describes and claims a 

35 device for measuring blood pressure; U.S. Patent No. 
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5,930,143 describes and claims a device for measuring the 
dimensions of machine tools; U.S. Patent No. 5,279,044 
describes and claims a measuring device for determining 
an absolute position of a movable element; U.S. Patent 
5 No. 5,186,042 describes and claims a device for measuring 
action force of a wheel; and U.S. Patent No. 4,24 6,774 
describes and claims a device for measuring the draft of 
smoking articles such as cigarettes. 

As for tissues not yet demonstrated to express 
10 AMLP1, the AMLP1 nucleic acid probes of the present 

invention are currently available as tools for surveying 
such tissues to detect the presence of AMLP1 nucleic 
acids . 

Survey tools - i.e., tools for determining the 

15 presence and/or location of a desired object by search of 
an area — are well known in many arts, not just in 
molecular biology, and are known to possess credible, 
specific, and substantial utility. For example, U.S. 
Patent No. 6,046,800 describes and claims a device for 

20 surveying an area for objects that move; U.S. Patent No. 
6,025,201 describes and claims an apparatus for locating 
and discriminating platelets from non-platelet particles 
or cells on a cell -by-cell basis in a whole blood sample; 
U.S. Patent No. 5,990,689 describes and claims a device 

25 for detecting and locating anomalies in the 

electromagnetic protection of a system; U.S. Patent No. 
5,984,175 describes and claims a device for detecting and 
identifying wearable user identification units; U.S. 
Patent No. 3,980,986 ("Oil well survey tool"), describes 

30 and claims a tool for finding the position of a drill bit 
working at the bottom of a borehole. 

As noted above, the nucleic acid probes of the 
present invention are useful in constructing microarrays; 
the microarrays, in turn, are products of manufacture 
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that are useful for measuring and for surveying gene 
expression. 

When included on a microarray , each AMLP1 
nucleic acid probe makes the microarray specifically 
5 useful for detecting that portion of the AMLP1 gene 
included within the probe, thus imparting upon the 
microarray device the ability to detect a signal where, 
absent such probe, it would have reported no signal. 
This utility makes each individual probe on such 

10 microarray akin to an antenna, circuit, firmware or 
software element included in an electronic apparatus, 
where the antenna, circuit, firmware or software element 
imparts upon the apparatus the ability newly and 
additionally to detect signal in a portion of the radio- 

15 frequency spectrum where previously it could not; such 
devices are known to have specific, substantial, and 
credible utility. 

Changes in the level of expression need not be 
observed for the measurement of expression to have 

20 utility. 

For example, where gene expression analysis is 
used to assess toxicity of chemical agents on cells, the 
failure of the agent to change a gene's expression level 
is evidence that the drug likely does not affect the 

25 pathway of which the gene's expressed protein is a part. 
Analogously, where gene expression analysis is used to 
assess side effects of pharmacologic agents - whether in 
lead compound discovery or in subsequent screening of 
lead compound derivatives - the inability of the agent to 

30 alter a gene's expression level is evidence that the drug 
does not affect the pathway of which the gene's expressed 
protein is a part. 

WO 99/58720, incorporated herein by reference 
in its entirety, provides methods for quantifying the 

35 relatedness of a first and second gene expression profile 
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and for ordering the relatedness of a plurality of gene 
expression profiles, without regard to the identity or 
function of the genes whose expression is used in the 
calculation. 

5 Gene expression analysis, including gene 

expression analysis by microarray hybridization, is, of 
course, principally a laboratory-based art. Devices and 
apparatus used principally in laboratories to facilitate 
laboratory research are well-established to possess 

10 specific, substantial, and credible utility. For 

example, U.S. Patent No. 6,001,233 describes and claims a 
gel electrophoresis apparatus having a cam-activated 
clamp; for example, U.S. Patent No. 6,051,831 describes 
and claims a high mass detector for use in time-of -flight 

15 mass spectrometers; for example, U.S. Patent NO. 

5,824,269 describes and claims a flow cytometer— as is 
well known, few gel electrophoresis apparatuses, TOF-MS 
devices, or flow cytometers are sold for consumer use. 
Indeed, and in particular, nucleic acid 

20 microarrays, as devices intended for laboratory use in 
measuring gene expression, are well-established to have 
specific, substantial and credible utility. Thus, the 
microarrays of the present invention have at least the 
specific, substantial and credible utilities of the 

25 microarrays claimed as devices and articles of 
manufacture in the following U.S. patents, the 
disclosures of each of which is incorporated herein by 
reference: U.S. Patent Nos . 5,445,934 ("Array of 
oligonucleotides on a solid substrate 11 ); 5,744,305 

30 ("Arrays of materials attached to a substrate") ; and 
6,004,752 ("Solid support with attached molecules") . 

Genome-derived single exon probes and genome - 
derived single exon probe microarrays have the additional 
utility, inter alia, of permitting high- throughput 

35 detection of splice variants of the nucleic acids of the 
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present invention, as further described in copending and 
commonly owned U.S. Patent application no. 09/632,366, 
filed August 3, 2000, the disclosure of which is 
incorporated herein by reference in its entirety. 
5 The isolated nucleic acids of the present 

invention can also be used to prime synthesis of nucleic 
acid, for purpose of either analysis or isolation, using 
mRNA, cDNA, or genomic DNA as template. 

For use as primers, at least 17 contiguous 

10 nucleotides of the isolated nucleic acids of the present 
invention will be used. Often, at least 18, 19, or 20 
contiguous nucleotides of the nucleic acids of the 
present invention will be used, and on occasion at least 
20, 22, 24, or 25 contiguous nucleotides of the nucleic 

15 acids of the present invention will be used, and even 30 
nucleotides or more of the nucleic acids of the present 
invention can be used to prime specific synthesis. 

The nucleic acid primers of the present 
invention can be used, for example, to prime first strand 

20 cDNA synthesis on an mRNA template. 

Such primer extension can be done directly to 
analyze the message. Alternatively, synthesis on an mRNA 
template can be done to produce first strand cDNA. The 
first strand cDNA can thereafter be used, inter alia, 

25 directly as a single -stranded probe, as above-described, 
as a template for sequencing - permitting identification 
of alterations, including deletions, insertions, and 
substitutions, both normal allelic variants and mutations 
associated with abnormal phenotypes- or as a template, 

30 either for second strand cDNA synthesis (e.g., as an 
antecedent to insertion into a cloning or expression 
vector), or for amplification. 

The nucleic acid primers of the present 
invention can also be used, for example, to prime single 

35 base extension (SBE) for SNP detection (see, e.g., U.S. 
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Pat. No. 6,004,744, the disclosure of which is 
incorporated herein by reference in its entirety) . 

As another example, the nucleic acid primers of 
the present invention can be used to prime amplification 
5 of AMLP1 nucleic acids, using transcript -derived or 
genomic DNA as template. 

Primer-directed amplification methods are now 
well-established in the art. Methods for performing the 
polymerase chain reaction (PCR) are compiled, inter alia, 

10 in McPherson, PCR (Basics: From Background to Bench) , 
Springer Verlag (2000) (ISBN: 0387916008); Innis et al . 
(eds . ) , PCR Applications: Protocols for Functional 
Genomics , Academic Press (1999) (ISBN: 0123721857) ; 
Gelfand et al . (eds.), PCR Strategies , Academic Press 

15 (1998) (ISBN: 0123721822); Newton et al. r PCR , 

Springer-Verlag New York (1997) (ISBN: 0387915060); Burke 
(ed . ) , PCR: Essential Techniques , John Wiley & Son Ltd 
(1996) (ISBN: 047195697X) ; White (ed.), PCR Cloning 
Protocols : From Molecular Cloning to Genetic Engineering, 

20 Vol. 67, Humana Press (1996) (ISBN: 0896033430); 

McPherson et al . (eds.), PCR 2: A Practical Approach , 
Oxford University Press, Inc. (1995) (ISBN: 0199634254), 
the disclosures of which are incorporated herein by 
reference in their entireties. Methods for performing 

25 RT-PCR are collected, e.g., in Siebert et al. (eds.), 
Gene Cloning and Analysis by RT-PCR , Eaton Publishing 
Company/Bio Techniques Books Division, 1998 (ISBN: 
1881299147); Siebert (ed.), PCR Technique : RT-PCR , Eaton 
Publishing Company/BioTechniques Books (1995) 

30 (ISBN:1881299139) , the disclosure of which is 

incorporated herein by reference in its entirety. 

Isothermal amplification approaches, such as 
rolling circle amplification, are also now well- 
described. See, e.g., Schweitzer et al . , Curr. Opin. 

35 Biotechnol. 12(l):21-7 (2001); U.S. Patent Nos . 
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6,235,502, 6,221,603, 6,210,884, 6,183,960, 5,854,033, 
5,714,320, 5,648,245, and international patent 
publications WO 97/19193 and WO 00/15779, the disclosures 
of which are incorporated herein by reference in their 
5 entireties. Rolling circle amplification can be combined 
with other techniques to facilitate SNP detection. See, 
e.g., Lizardi et al . , Nature Genet. 19(3):225-32 (1998). 

As further described below, nucleic acids of 
the present invention, inserted into vectors that flank 

10 the nucleic acid insert with a phage promoter, such as 
T7, T3, or SP6 promoter,' can be used to drive in vitro 
expression of RNA complementary to either strand of the 
nucleic acid of the present invention. The RNA can be 
used, inter alia, as a single -stranded probe, in cDNA- 

15 mRNA subtraction, or for in vitro translation. 

As will be further discussed herein below, 
nucleic acids of the present invention that encode AMLP1 
protein or portions thereof can be used, inter alia, to 
express the AMLP1 proteins or protein fragments, either 

20 alone, or as part of fusion proteins. 

Expression can be from genomic nucleic acids of 
the present invention, or from transcript -derived nucleic 
acids of the present invention. 

Where protein expression is effected from 

25 genomic DNA, expression will typically be effected in 
eukaryotic, typically mammalian, cells capable of 
splicing introns from the initial RNA transcript. 
Expression can be driven from episomal vectors, such as 
EBV-based vectors, or can be effected from genomic DNA 

30 integrated into a host cell chromosome. As will be more 
fully described below, where expression is from 
transcript -derived (or otherwise intron-less) nucleic 
acids of the present invention, expression can be 
effected in wide variety of prokaryotic or eukaryotic 

35 cells. 
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Expressed in vitro, the protein, protein 
fragment, or protein fusion can thereafter be isolated, 
to be used, inter alia, as a standard in immunoassays 
specific for the proteins, or protein isoforms, of the 
5 present invention; to be used as a therapeutic agent, 
e.g., to be administered as passive replacement therapy 
in individuals deficient in the proteins of the present 
invention, or to be administered as a vaccine; to be used 
for in vitro production of specific antibody, the 

10 antibody thereafter to be used, e.g., as an analytical 

reagent for detection and quantitation of the proteins of 
the present invention or to be used as an 
immunotherapeutic agent. 

The isolated nucleic acids of the present 

15 invention can also be used to drive in vivo expression of 
the proteins of the present invention. In vivo 
expression can be driven from a vector — typically a 
viral vector, often a vector based upon a replication 
incompetent retrovirus, an adenovirus, or an adeno- 

20 associated virus (AAV) — for purpose of gene therapy. In 
vivo expression can also be driven from signals 
endogenous to the nucleic acid or from a vector, often a 
plasmid vector, such as pVAXl (Invitrogen, Carlsbad CA, 
USA) , for purpose of "naked" nucleic acid vaccination, as 

25 further described in U.S. Pat. Nos. 5,589,466; 5,679,647; 
5,804,566; 5,830,877; 5,843,913; 5,880,104; 5,958,891; 
5,985,847; 6,017,897; 6,110,898; 6,204,250, the 
disclosures of which are incorporated herein by reference 
in their entireties. 

30 The nucleic acids of the present invention can 

also be used for antisense inhibition of transcription or 
translation. See Phillips (ed.), Antisense Technology, 
Part B , Methods in Enzymology Vol. 314, Academic Press, 
Inc. (1999) (ISBN: 012182215X) ; Phillips (ed.), Antisense 

35 Technology, Part A , Methods in Enzymology Vol. 313, 
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Academic Press, Inc. (1999) (ISBN: 0121822141); Hartmann 
et al. (eds.), Manual of Antisense Methodology 
(Perspectives in Antisense Science) , Kluwer Law 
International (1999) (ISBN: 079238539X) ; Stein et al . 
5 (eds.), Applied Antisense Oligonucleotide Technology , 
Wiley-Liss (cover (1998) (ISBN: 0471172790); Agrawal et 
al. (eds.), Antisense Research and Application , 
Springer-Verlag New York, Inc. (1998) (ISBN: 3540638334); 
Lichtenstein et al . (eds.), Antisense Technology: A 

10 Practical Approach , Vol. 185, Oxford University Press, 
INC. (1998) (ISBN: 0199635838); Gibson (ed.), Antisense 
and Ribozyme Methodology: Laboratory Companion , Chapman & 
Hall (1997) (ISBN: 3826100794); Chadwick et al . (eds.), 
Oligonucleotides as Therapeutic Agents - Symposium No. 

15 209, John Wiley & Son Ltd (1997) (ISBN: 0471972797), the 
disclosures of which are incorporated herein by reference 
in their entireties. 

Nucleic acids of the present invention, 
particularly cDNAs of the present invention, that encode 

20 full-length AMLP1 protein isoforms, have additional, 
well-recognized, immediate, real world utility as 
commercial products of manufacture suitable for sale. 

For example, Invitrogen Corp. (Carlsbad, CA, 
USA) , through its Research Genetics subsidiary, sells 

25 full length human cDNAs cloned into one of a selection of 
expression vectors as GeneStorm® expression-ready clones; 
utility is specific for the gene, since each gene is 
capable of being ordered separately and has a distinct 
catalogue number, and utility is substantial, each clone 

30 selling for $650.00 US. Similarly, Incyte Genomics (Palo 
Alto, CA, USA) sells clones from public and proprietary 
sources in multi-well plates or individual tubes. 

Nucleic acids of the present invention that 
include genomic regions encoding the AMLP1 protein, or 

35 portions thereof, have yet further utilities. 
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For example, genomic nucleic acids of the 
present invention can be used as amplification 
substrates, e.g. for preparation of genome -de rived single 
exon probes of the present invention, as described above 
5 and in copending and commonly-owned U.S. patent 
application nos . 09/864,761, filed May 23, 2001, 
09/774,203, filed January 29, 2001, and 09/632,366, filed 
August 3, 2000, the disclosures of which are incorporated 
herein by reference in their entireties. 

10 As another example, genomic nucleic acids of 

the present invention can be integrated non-homologously 
into the genome of somatic cells, e.g. CHO cells, COS 
cells, or 293 cells, with or without amplification of the 
insertional locus, in order, e.g., to create stable cell 

15 lines capable of producing the proteins of the present 
invention. 

As another example, more fully described herein 
below, genomic nucleic acids of the present invention can 
be integrated nonhomologously into embryonic stem (ES) 

20 cells to create transgenic non-human animals capable of 
producing the proteins of the present invention. 

Genomic nucleic acids of the present invention 
can also be used to target homologous recombination to 
the AMLP1 locus. See, e.g., U.S. Patent Nos. 6,187,305; 

25 6,204,061; 5,631,153; 5,627,059; 5,487,992; 5,464,764; 
5,614,396; 5,527,695 and 6,063,630; and Kmiec et al . 
(eds.), Gene Targeting Protocols , Vol. 133, Humana Press 
(2000) (ISBN: 0896033600); Joyner (ed.), Gene Targeting: 
A Practical Approach , Oxford University Press, Inc. 

30 (2000) (ISBN: 0199637938); Sedivy et al . , Gene Targeting , 
Oxford University Press (1998) (ISBN: 071677013X) ; Tymms 
et al. (eds.), Gene Knockout Protocols , Humana Press 
(2000) (ISBN: 0896035727); Mak et al . (eds.), The Gene 
Knockout FactsBook , Vol. 2, Academic Press, Inc. (1998) 

35 (ISBN: 0124660444); Torres et al . , Laboratory Protocols 
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for Conditional Gene Targeting , Oxford University Press 
(1997) (ISBN: 019963677X) ; Vega (ed.) # Gene Targeting , 
CRC Press, LLC (1994) (ISBN: 084938950X) , the disclosures 
of which are incorporated herein by reference in their 
5 entireties. 

Where the genomic region includes transcription 
regulatory elements, homologous recombination can be used 
to alter the expression of AMLP1, both for purpose of in 
vitro production of AMLP1 protein from human cells, and 

10 for purpose of gene therapy. See, e.g., U.S. Pat. Nos. 
5,981,214, 6,048,524; 5,272,071. 

Fragments of the nucleic acids of the present 
invention smaller than those typically used for 
homologous recombination can also be used for targeted 

15 gene correction or alteration, possibly by cellular 

mechanisms different from those engaged during homologous 
recombination . 

For example, partially duplexed RNA/DNA 
chimeras have been shown to have utility in targeted gene 

20 correction, U.S. Pat. Nos. 5,945,339, 5,888,983, 

5,871,984, 5,795,972, 5,780,296, 5,760,012, 5,756,325, 
5,731,181, the disclosures of which are incorporated 
herein by reference in their entireties. So too have 
small oligonucleotides fused to triplexing domains have 

25 been shown to have utility in targeted gene correction, 
Culver et al . , "Correction of chromosomal point mutations 
in human cells with bifunctional oligonucleotides," 
Nature Biotechnol . 17 (10) : 989-93 (1999), as have 
oligonucleotides having modified terminal bases or 

30 modified terminal internucleoside bonds, Gamper et al . , 

Nucl. Acids Res. 28 (21) : 4332-9 (2000), the disclosures of 
which are incorporated herein by reference. 

The isolated nucleic acids of the present 
invention can also be used to provide the initial 

35 substrate for recombinant engineering of AMLP1 protein 
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variants having desired phenotypic improvements. Such 
engineering includes, for example, site-directed 
mutagenesis, random mutagenesis with subsequent 
functional screening, and more elegant schemes for 
5 recombinant evolution of proteins, as are described, 
inter alia, in U.S. Pat. Nos. 6,180,406; 6,165,793; 
6,117,679; and 6,096,548, the disclosures of which are 
incorporated herein by reference in their entireties. 

Nucleic acids of the present invention can be 

10 obtained by using the labeled probes of the present 

invention to probe nucleic acid samples, such as genomic 
libraries, cDNA libraries, and mRNA samples, by standard 
techniques. Nucleic acids of the present invention can 
also be obtained by amplification, using the nucleic acid 

15 primers of the present invention, as further demonstrated 
in Example 1, herein below. Nucleic acids of the present 
invention of fewer than about 100 nt can also be 
synthesized chemically, typically by solid phase 
synthesis using commercially available automated 

20 synthesizers. 

"Full Length" AMLP1 Nucleic Acids 

In a first series of nucleic acid embodiments, 
25 the invention provides isolated nucleic acids that encode 
the entirety of the human AMLP1 protein. As discussed 
above, the "full -length" nucleic acids of the present 
invention can be used, inter alia, to express full length 
human AMLPl protein. The full-length nucleic acids can 
30 also be used as nucleic acid probes; used as probes, the 
isolated nucleic acids of these embodiments will 
hybridize to human AMLPl. 

In a first such embodiment, the invention 
provides an isolated nucleic acid comprising (i) the 
35 nucleotide sequence of SEQ ID NO: 1, or (ii) the 
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complement of (i) . SEQ ID NO: 1 presents the entire cDNA 
of human AMLPla, including the 5' untranslated (UT) 
region and 3 • UT. 

In a second embodiment, the invention provides 
5 an isolated nucleic acid comprising (i) the nucleotide 
sequence of SEQ ID NO: 2, (ii) a degenerate variant of 
the nucleotide sequence of SEQ ID NO: 2, or (iii) the 
complement of (i) or (ii) . SEQ ID NO: 2 presents the open 
reading frame (ORF) from SEQ ID NO: 2. 

10 In a third embodiment, the invention provides 

an isolated nucleic acid comprising (i) a nucleotide 
sequence that encodes a polypeptide with the amino acid 
sequence of SEQ ID NO: 3 or (ii) the complement of a 
nucleotide sequence that encodes a polypeptide with the 

15 amino acid sequence of SEQ ID NO: 3. SEQ ID NO: 3 
provides the amino acid sequence of human AMLPla. 

In a fourth embodiment, the invention provides 
an isolated nucleic acid having a nucleotide sequence 
that (i) encodes a polypeptide having the sequence of SEQ 

20 ID NO: 3, (ii) encodes a polypeptide having the sequence 
of SEQ ID NO: 3 with conservative amino acid 
substitutions, or (iii) that is the complement of (i) or 
(ii) , where SEQ ID NO: 3 provides the amino acid sequence 
of human AMLPla. 

25 In another such embodiment, the invention 

provides an isolated nucleic acid comprising (i) the 
nucleotide sequence of SEQ ID NO: 4, or (ii) the 
complement of (i) . SEQ ID NO: 4 presents the entire cDNA 
of human AMLPlb, including the 5' untranslated (UT) 

30 region and 3 1 UT. 

In another embodiment, the invention provides 
an isolated nucleic acid comprising (i) the nucleotide 
sequence of SEQ ID NO: 5, (ii) a degenerate variant of 
the nucleotide sequence of SEQ ID NO: 5, or (iii) the 
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complement of (i) or (ii) • SEQ ID NO: 5 presents the open 
reading frame (ORF) from SEQ ID NO: 5. 

In another embodiment, the invention provides 
an isolated nucleic acid comprising (i) a nucleotide 
5 sequence that encodes a polypeptide with the amino acid 
sequence of SEQ ID NO: 6 or (ii) the complement of a 
nucleotide sequence that encodes a polypeptide with the 
amino acid sequence of SEQ ID NO: 6. SEQ ID NO: 6 
provides the amino acid sequence of human AMLPla. 

10 In another embodiment, the invention provides 

an isolated nucleic acid having a nucleotide sequence 
that (i) encodes a polypeptide having the sequence of SEQ 
ID NO: 6, (ii) encodes a polypeptide having the sequence 
of SEQ ID NO: 6 with conservative amino acid 

15 substitutions, or (iii) that is the complement of (i) or 
(ii) , where SEQ ID NO: 6 provides the amino acid sequence 
of human AMLPla. 



Selected Partial Nucleic Acids 

20 

In a second series of nucleic acid embodiments, 
the invention provides isolated nucleic acids that encode 
select portions of human AMLP1 . As will be further 
discussed herein below, these "partial" nucleic acids can 

25 be used, inter alia, to express specific portions of the 
human AMLP1 . These "partial" nucleic acids can also be 
used, inter alia, as nucleic probes. 

In a first such embodiment, the invention 
provides an isolated nucleic acid comprising (i) the 

30 nucleotide sequence of SEQ ID NO: 7, (ii) a degenerate 
variant of SEQ ID NO: 7, or (iii) the complement of (i) 
or (ii) , wherein the isolated nucleic acid is no more 
than about 100 kb in length, typically no more than about 
75 kb in length, more typically no more than about 50 kb 

35 length. SEQ ID NO: 7 encodes a novel portion of AMLP1 . 
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Often, the isolated nucleic acids of this embodiment are 
no more than about 25 kb in length, often no more than 
about 15 kb in length, and frequently no more than about 
10 kb in length. 
5 In another embodiment, the invention provides 

an isolated nucleic acid comprising (i) a nucleotide 
sequence that encodes SEQ ID NO: 8 or (ii) the complement 
of a nucleotide sequence that encodes SEQ ID NO: 8, 
wherein the isolated nucleic acid is no more than about 

10 100 kb in length, typically no more than about 75 kb in 
length, frequently no more than about 50 kb in length. 
SEQ ID NO: 8 is the amino acid sequence encoded by the 
portion of AMLP1 not found in any EST fragments. Often, 
the isolated nucleic acids of this embodiment are no more 

15 than about 25 kb in length, often no more than about 15 
kb in length, and frequently no more than about 10 kb in 
length. 

In another embodiment, the invention provides 
an isolated nucleic acid comprising (i) a nucleotide 

20 sequence that encodes SEQ ID NO: 8, (ii) a nucleotide 
sequence that encodes SEQ ID NO: 8 with conservative 
substititions, or (iii) the complement of (i) or (ii) , 
wherein the isolated nucleic acid is no more than about 
100 kb in length, typically no more than about 75 kb in 

25 length, and often no more than about 50 kb in length. 

Often, the isolated nucleic acids of this embodiment are 
no more than about 25 kb in length, often no more than 
about 15 kb in length, and frequently no more than about 
10 kb in length. 

30 In another such embodiment, the invention 

provides an isolated nucleic acid comprising (i) the 
nucleotide sequence of SEQ ID NO: 33, (ii) or the 
complement of (i) , wherein the isolated nucleic acid is 
no more than about 100 kb in length, typically no more 

35 than about 75 kb in length, more typically no more than 
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about 50 kb length. SEQ ID NO: 33 encodes the novel exon 
of AMLPlb not found in AMLPla or any EST sequences. 
Often, the isolated nucleic acids of this embodiment are 
no more than about 25 kb in length, often no more than 
5 about 15 kb in length, and frequently no more than about 
10 kb in length. 



Cross-Hybridizing Nucleic Acids 

10 In another series of nucleic acid embodiments, 

the invention provides isolated nucleic acids that 
hybridize to various of the human AMLP1 nucleic acids of 
the present invention. These cross-hybridizing nucleic 
acids can be used, inter alia, as probes for, and to 

15 drive expression of, proteins that are related to human 
AMLP1 of the present invention as further isoforms, 
homologues, paralogues, or orthologues. 

In a first such embodiment, the invention 
provides an isolated nucleic acid comprising a sequence 

20 that hybridizes under high stringency conditions to a 
probe the nucleotide sequence of which consists of at 
least 17 nt, 18, 19, 20, 21, 22, 23, 24, 25, 30, 40, or 
50 nt of SEQ ID NO: 7 or the complement of SEQ ID NO: 7, 
wherein the isolated nucleic acid is no more than about 

25 100 kb in length, typically no more than about 75 kb in 
length, and often no more than about 50 kb in length. 
Often, the isolated nucleic acids of this embodiment are 
no more than about 25 kb in length, often no more than 
about 15 kb in length, and frequently no more than about 

30 10 kb in length. 

In a further embodiment, the invention provides 
an isolated nucleic acid comprising a sequence that 
hybridizes under moderate stringency conditions to a 
probe the nucleotide sequence of which consists of at 

35 least 17 nt, 18, 19, 20, 21, 22, 23, 24, 25, 30, 40, or 



WO 03/037931 PCT/US02/35129 

50 nt of SEQ ID NO: 7 or the complement of SEQ ID NO: 7 , 
wherein the isolated nucleic acid is no more than about 
100 kb in length, typically no more than about 75 kb in 
length, and often no more than about 50 kb in length. 
5 Often, the isolated nucleic acids of this embodiment are 
no more than about 2 5 kb in length, often no more than 
about 15 kb in length, and frequently no more than about 
10 kb in length. 

In a further embodiment, the invention provides 

10 an isolated nucleic acid comprising a sequence that 
hybridizes under high stringency conditions to a 
hybridization probe the nucleotide sequence of which (i) 
encodes a polypeptide having the sequence of SEQ ID NO: 
8, (ii) encodes a polypeptide having the sequence of SEQ 

15 ID NO: 8 with conservative amino acid substitutions, or 
(iii) is the complement of (i) or (ii) , wherein the 
isolated nucleic acid is no more than about 100 kb in 
length, typically no more than about 75 kb in length, and 
often no more than about 50 kb in length. Often, the 

20 isolated nucleic acids of this embodiment are no more 
than about 25 kb in length, often no more than about 15 
kb in length, and frequently no more than about 10 kb in 
length. 

In a first such embodiment, the invention 
25 provides an isolated nucleic acid comprising a sequence 
that hybridizes under high stringency conditions to a 
probe the nucleotide sequence of which consists of at 
least 17 nt, 18, 19, 20, 21, 22, 23, 24, 25, 30, 40, or 
50 nt of SEQ ID NO: 33 or the complement of SEQ ID NO: 
30 33, wherein the isolated nucleic acid is no more than 
about 100 kb in length, typically no more than about 75 
kb in length, and often no more than about 50 kb in 
length. Often, the isolated nucleic acids of this 
embodiment are no more than about 25 kb in length, often 
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no more than about 15 kb in length, and frequently no 
more than about 10 kb in length. 

In a further embodiment, the invention provides 
an isolated nucleic acid comprising a sequence that 
5 hybridizes under moderate stringency conditions to a 
probe the nucleotide sequence of which consists of at 
least 17 nt, 18, 19, 20, 21, 22, 23, 24, 25, 30, 40, or 
50 nt of SEQ ID NO: 33 or the complement of SEQ ID NO: 
33, wherein the isolated nucleic acid is no more than 

10 about 100 kb in length, typically no more than about 75 
kb in length, and often no more than about 50 kb in 
length. Often, the isolated nucleic acids of this 
embodiment are no more than about 25 kb in length, often 
no more than about 15 kb in length, and frequently no 

15 more than about 10 kb in length. 

Particularly Useful Nucleic Acids 

Particularly useful among the above -described 
20 nucleic acids are those that are expressed, or the 
complement of which are expressed, in brain, liver, 
kidney, and adrenal gland, as well as prostate, testis, 
lung, placenta, skeletal muscle, heart, bone marrow and 
colon tumor. 

25 Also particularly useful among the above- 

described nucleic acids are those that encode, or the 
complement of which encode, a polypeptide as an adaptor 
protein that interacts with both angiostatin-like protein 
and components of the actin cytoskeleton and has anti- 

30 angiogenesis activity. 

Other particularly useful embodiments of the 
nucleic acids above-described are those that encode, or 
the complement of which encode, a polypeptide having a 
partial Myosin-tail motif. 
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In another series of nucleic acid embodiments, 
the invention provides fragments of various of the 
5 isolated nucleic acids of the present invention which 
prove useful, inter alia, as nucleic acid probes, as 
amplification primers, and to direct expression or 
synthesis of epitopic or immunogenic protein fragments. 
In a first such embodiment, the invention 

10 provides an isolated nucleic acid comprising at least 17 
nucleotides, 18 nucleotides, 20 nucleotides, 24 
nucleotides, or 25 nucleotides of (i) SEQ ID NO: 7, (ii) 
a degenerate variant of SEQ ID NO: 7, or (iii) the 
complement of (i) or (ii) , wherein the isolated nucleic 

15 acid is no more than about 100 kb in length, typically no 
more than about 75 kb in length, more typically no more 
than about 50 kb in length. Often, the isolated nucleic 
acids of this embodiment are no more than about 25 kb in 
length, often no more than about 15 kb in length, and 

20 frequently no more than about 10 kb in length. 

The invention also provides an isolated nucleic 
acid comprising (i) a nucleotide sequence that encodes a 
peptide of at least 8 contiguous amino acids of SEQ ID 
NO: 8, (ii) a nucleotide sequence that encodes a peptide 

25 of at least 15 contiguous amino acids of SEQ ID NO: 8, or 
(iii) the complement of (i) or (ii) , wherein the isolated 
nucleic acid is no more than about 100 kb in length, 
typically no more than about 75 kb in length, more 
typically no more than about 50 kb in length. Often, the 

30 isolated nucleic acids of this embodiment are no more 
than about 25 kb in length, often no more than about 15 
kb in length, and frequently no more than about 10 kb in 
length. 

The invention also provides an isolated nucleic 
35 acid comprising a nucleotide sequence that encodes (i) a 
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polypeptide having the sequence of at least 8 contiguous 
amino acids of SEQ ID NO: 8 with conservative amino acid 
substitutions, (ii) a polypeptide having the sequence of 
at least 15 contiguous amino acids of SEQ ID NO: 8 with 
5 nservative amino acid substitutions, (iii) a polypeptide 
having the sequence of at least 8 contiguous amino acids 
of SEQ ID NO: 8 with moderately conservative 
substitutions, (iv) a polypeptide having the sequence of 
at last 15 congiuous amino acids of SEQ ID NO: 8 with 

10 moderately conservative substitutions, or (v) the 

complement of any of (i) - (iv) , wherein the isolated 
nucleic acid is no more than about 100 kb in length, 
typically no more than about 75 kb in length, more 
typically no more than about 50 kb in length. Often, the 

15 isolated nucleic acids of this embodiment are no more 
than about 25 kb in length, often no more than about 15 
kb in length, and frequently no more than about 10 kb in 
length. 

In another such embodiment, the invention 
20 provides an isolated nucleic acid comprising at least 17 
nucleotides, 18 nucleotides, 20 nucleotides, 24 
nucleotides, or 25 nucleotides of (i) SEQ ID NO: 33, (ii) 
a degenerate variant of SEQ ID NO: 33, or (iii) the 
complement of (i) or (ii) , wherein the isolated nucleic 
25 acid is no more than about 100 kb in length, typically no 
more than about 75 kb in length, more typically no more 
than about 50 kb in length. Often, the isolated nucleic 
acids of this embodiment are no more than about 25 kb in 
length, often no more than about 15 kb in length, and 
30 frequently no more than about 10 kb in length. 

Single Exon Probes 



35 



The invention further provides genome -derived 
single exon probes having portions of no more than one 
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exon of the AMLP1 gene. As further described in commonly 
owned and copending U.S. patent application serial no. 
09/632,366, filed August 3, 2000 ("Methods and Apparatus 
for High Throughput Detection and Characterization of 
5 alternatively Spliced Genes"), the disclosure of which is 
incorporated herein by reference in its entirety, such 
single exon probes have particular utility in identifying 
and characterizing splice variants. In particular, such 
single exon probes are useful for identifying and 
10 discriminating the expression of distinct isoforms of 
AMLP1. 

In a first embodiment, the invention provides 
an isolated nucleic acid comprising a nucleotide sequence 
of no more than one portion of SEQ ID NOs : 9-20 and SEQ 

15 ID NO: 33 or the complement of SEQ ID NOs: 9-20 and SEQ 
ID NO: 33, wherein the portion comprises at least 17 
contiguous nucleotides, 18 contiguous nucleotides, 20 
contiguous nucleotides, 24 contiguous nucleotides, 25 
contiguous nucleotides, or 50 contiguous nucleotides of 

20 any one of SEQ ID NOs : 9-22 and SEQ ID NO: 33, or their 
complement. In a further embodiment, the exonic portion 
comprises the entirety of the referenced SEQ ID NO: or 
its complement. 

In other embodiments, the invention provides 

25 isolated single exon probes having the nucleotide 

sequence of any one of SEQ ID NOs : 21 - 32 and SEQ ID NO: 
34. 



Transcription Control Nucleic Acids 

30 

In another aspect, the present invention 
provides genome -derived isolated nucleic acids that 
include nucleic acid sequence elements that control 
transcription of the AMLP1 gene. These nucleic acids can 
35 be used, inter alia, to drive expression of heterologous 
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coding regions in recombinant constructs, thus conferring 
upon such heterologous coding regions the expression 
pattern of the native AMLP1 gene. These nucleic acids 
can also be used, conversely, to target heterologous 
5 transcription control elements to the AMLP1 genomic 

locus, altering the expression pattern of the AMLP1 gene 
itself. 

In a first such embodiment, the invention 
provides an isolated nucleic acid comprising the 

10 nucleotide sequence of SEQ ID NO: 35 or its complement, 
wherein the isolated nucleic acid is no more than about 
100 kb in length, typically no more than about 75 kb in 
length, more typically no more than about 50 kb in 
length. Often, the isolated nucleic acids of this 

15 embodiment are no more than about 25 kb in length, often 
no more than about 15 kb in length, and frequently no 
more than about 10 kb in length. 

In another embodiment, the invention provides 
an isolated nucleic acid comprising at least 17, 18, 20, 

20 24, or 25 nucleotides of the sequence of SEQ ID NO: 35 or 
its complement, wherein the isolated nucleic acid is no 
more than about 100 kb in length, typically no more than 
about 75 kb in length, more typically no more than about 
50 kb in length. Often, the isolated nucleic acids of 

25 this embodiment are no more than about 25 kb in length, 
often no more than about 15 kb in length, and frequently 
no more than about 10 kb in length. 

VECTORS AND HOST CELLS 

30 

In another aspect, the present invention 
provides vectors that comprise one or more of the 
isolated nucleic acids of the present invention, and host 
cells in which such vectors have been introduced. 
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The vectors can be used, inter alia, for 
propagating the nucleic acids of the present invention in 
host cells (cloning vectors) , for shuttling the nucleic 
acids of the present invention between host cells derived 
5 from disparate organisms (shuttle vectors) , for inserting 
the nucleic acids of the present invention into host cell 
chromosomes (insertion vectors) , for expressing sense or 
antisense RNA transcripts of the nucleic acids of the 
present invention in vitro or within a host cell, and for 

10 expressing polypeptides encoded by the nucleic acids of 
the present invention, alone or as fusions to 
heterologous polypeptides. Vectors of the present 
invention will often be suitable for several such uses. 

Vectors are by now well-known in the art, and 

15 are described, inter alia, in Jones et al . (eds.), 
Vectors; Cloning Applications : Essential Techniques 
(Essential Techniques Series) , John Wiley & Son Ltd 1998 
(ISBN: 047196266X) ; Jones et al . (eds.), Vectors : 
Expression Systems : Essential Techniques (Essential 

20 Techniques Series) , John Wiley & Son Ltd, 1998 

(ISBN: 0471962678) ; Gacesa et al . , Vectors : Essential 
Data , John Wiley & Sons, 1995 (ISBN: 0471948411) ; 
Cid-Arregui (eds.), Viral Vectors: Basic Science and Gene 
Therapy , Eaton Publishing Co., 2000 (ISBN: 188129935X) ; 

25 Sambrook et al., Molecular Cloning: A Laboratory Manual 
(3 rd ed.), Cold Spring Harbor Laboratory Press, 2001 
(ISBN: 0879695773); Ausubel et al . (eds.), Short 
Protocols in Molecular Biology: A Compendium of Methods 
from Current Protocols in Molecular Biology (4 th ed.), 

30 John Wiley & Sons, 1999 (ISBN: 047132938X) , the 

disclosures of which are incorporated herein by reference 
in their entireties. Furthermore, an enormous variety of 
vectors are available commercially. Use of existing 
vectors and modifications thereof being well within the 
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skill in the art, only basic features need be described 
here. 

Typically, vectors are derived from virus, 
plasmid, prokaryotic or eukaryotic chromosomal elements, 
5 or some combination thereof, and include at least one 

origin of replication, at least one site for insertion of 
heterologous nucleic acid, typically in the form of a 
polylinker with multiple, tightly clustered, single 
cutting restriction sites, and at least one selectable 

10 marker, although some integrative vectors will lack an 

origin that is functional in the host to be chromosomally 
modified, and some vectors will lack selectable markers. 

Vectors of the present invention will further include at 
least one nucleic acid of the present invention inserted 

15 into the vector in at least one location. 

Where present, the origin of replication and 
selectable markers are chosen based upon the desired host 
cell or host cells; the host cells, in turn, are selected 
based upon the desired application. 

20 For example, prokaryotic cells, typically 

E. coli, are typically chosen for cloning. In such case, 
vector replication is predicated on the replication 
strategies of coliform- infecting phage — such as phage 
lambda, M13, T7, T3 and PI — or on the replication origin 

25 of autonomously replicating episomes, notably the ColEl 
plasmid and later derivatives, including pBR322 and the 
pUC series plasmids. Where E. coli is used as host, 
selectable markers are, analogously, chosen for 
selectivity in gram negative bacteria: e.g., typical 

30 markers confer resistance to antibiotics, such as 

ampicillin, tetracycline, chloramphenicol, kanamycin, 
streptomycin, zeocin; auxotrophic markers can also be 
used. 

As another example, yeast cells, typically S. 
35 cerevisiae, are chosen, inter alia, for eukaryotic 
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genetic studies, due to the ease of targeting genetic 
changes by homologous recombination and to the ready 
ability to complement genetic defects using recombinantly 
expressed proteins, for identification of interacting 
5 protein components, e.g. through use of a two-hybrid 
system, and for protein expression. Vectors of the 
present invention for use in yeast will typically, but 
not invariably, contain an origin of replication suitable 
for use in yeast and a selectable marker that is 

10 functional in yeast. 

Integrative Yip vectors do not replicate 
autonomously, but integrate, typically in single copy, 
into the yeast genome at low frequencies and thus 
replicate as part of the host cell chromosome; these 

15 vectors lack an origin of replication that is functional 
in yeast, although they typically have at least one 
origin of replication suitable for propagation of the 
vector in bacterial cells. YEp vectors, in contrast, 
replicate episomally and autonomously due to presence of 

20 the yeast 2 micron plasmid origin (2 Jim ori) . The YCp 
yeast centromere plasmid vectors are autonomously 
replicating vectors containing centromere sequences, CEN, 
and autonomously replicating sequences, ARS; the ARS 
sequences are believed to correspond to the natural 

25 replication origins of yeast chromosomes. YACs are based 
on yeast linear plasmids, denoted YLp, containing 
homologous or heterologous DNA sequences that function as 
telomeres (TEL) in vivo, as well as containing yeast ARS 
(origins of replication) and CEN (centromeres) segments. 

30 Selectable markers in yeast vectors include a 

variety of auxotrophic markers, the most common of which 
are (in Saccharomyces cerevisiae) URA3, HIS3, LEU2, TRP1 
and LYS2, which complement specific auxotrophic 
mutations, such as ura3-52, his3-Dl, Ieu2-Dl, trpl-Dl and 

35 lys2-201. The URA3 and LYS2 yeast genes further permit 
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negative selection based on specific inhibitors, 
5-f luoro-orotic acid (FOA) and a-aminoadipic acid (aAA) , 
respectively, that prevent growth of the prototrophic 
strains but allows growth of the ura3 and lys2 mutants, 
5 respectively. Other selectable markers confer resistance 
to, e.g., zeocin. 

As yet another example, insect cells are often 
chosen for high efficiency protein expression. Where the 
host cells are from Spodoptera frugiperda - e.g., Sf9 and 

TM 

10 Sf21 cell lines, and expresSF cells (Protein Sciences 
Corp., Meriden, CT, USA) — the vector replicative 
strategy is typically based upon the baculovirus life 
cycle. Typically, baculovirus transfer vectors are used 
to replace the wild- type AcMNPV polyhedrin gene with a 

15 heterologous gene of interest. Sequences that flank the 
polyhedrin gene in the wild-type genome are positioned 5' 
and 3' of the expression cassette on the transfer 
vectors. Following cotransf ection with AcMNPV DNA, a 
homologous recombination event occurs between these 

20 sequences resulting in a recombinant virus carrying the 
gene of interest and the polyhedrin or plO promoter. 
Selection can be based upon visual screening for lacZ 
fusion activity. 

As yet another example, mammalian cells are 

25 often chosen for expression of proteins intended as 

pharmaceutical agents, and are also chosen as host cells 
for screening of potential agonist and antagonists of a 
protein or a physiological pathway. 

Where mammalian cells are chosen as host cells, 

30 vectors intended for autonomous extrachromosomal 

replication will typically include a viral origin, such 
as the SV40 origin (for replication in cell lines 
expressing the large T-antigen, such as COS1 and COS7 
cells) , the papillomavirus origin, or the EBV origin for 
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long term episomal replication (for use, e.g., in 
293-EBNA cells, which constitutively express the EBV 
EBNA-1 gene product and adenovirus E1A) . Vectors 
intended for integration, and thus replication as part of 
5 the mammalian chromosome, can, but need not, include an 
origin of replication functional in mammalian cells, such 
as the SV4 0 origin. Vectors based upon viruses, such as 
adenovirus, adeno-associated virus, vaccinia virus, and 
various mammalian retroviruses, will typically replicate 

10 according to the viral replicative strategy. 

Selectable markers for use in mammalian cells 
include resistance to neomycin (G418) , blast icidin, 
hygromycin and to zeocin, and selection based upon the 
purine salvage pathway using HAT medium. 

15 Plant cells can also be used for expression, 

with the vector replicon typically derived from a plant 
virus (e.g., cauliflower mosaic virus, CaMV; tobacco 
mosaic virus, TMV) and selectable markers chosen for 
suitability in plants. 

20 For propagation of nucleic acids of the present 

invention that are larger than can readily be accomodated 
in vectors derived from plasmids or virus, the invention 
further provides artificial chromosomes - BACs, YACs, 
PACs, and HACs — that comprise AMLP1 nucleic acids, often 

25 genomic nucleic acids. 

The BAC system is based on the 
well-characterized E. coli F- factor, a low copy plasmid 
that exists in a supercoiled circular form in host cells. 
The structural features of the F-factor allow stable 

30 maintenance of individual human DNA clones as well as 

easy manipulation of the cloned DNA. See Shizuya et al . , 
Keio J. Med. 50(l):26-30 (2001); Shizuya et al . , Proc. 
Natl. Acad. Sci. USA 89 (18) : 8794-7 (1992). 

YACs are based on yeast linear plasmids, 

35 denoted YLp, containing homologous or heterologous DNA 
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sequences that function as telomeres (TEL) in vivo, as 
well as containing yeast ARS (origins of replication) and 
CEN (centromeres) segments. 

HACs are human artifical chromosomes. Kuroiwa 
5 et al., Mature Biotechnol. 18 (10) : 1086-90 (2000); Henning 
etal., Proc. Natl. Acad. Sci. USA 96(2):592-7 (1999); 
Harrington et al . , Nature Genet. 15(4):345-55 (1997). In 
one version, long synthetic arrays of alpha satellite DNA 
are combined with telomeric DNA and genomic DNA to 

10 generate linear microchromosomes that are mitotically and 
cytogenetically stable in the absence of selection. 

PACs are PI -derived artificial chromosomes. 
Sternberg, Proc. Natl. Acad. Sci. USA 87(l):103-7 (1990); 
Sternberg et al . , New Biol . 2(2):151-62 (1990); Pierce et 

15 al., Proc. Natl Acad. Sci. USA 89 (6) : 2056-60 (1992). 

Vectors of the present invention will also 
often include elements that permit in vitro transcription 
of RNA from the inserted heterologous nucleic acid. 

Such vectors typically include a phage promoter, such as 

20 that from T7, T3, or SP6 , flanking the nucleic acid 
insert. Often two different such promoters flank the 
inserted nucleic acid, permitting separate in vitro 
production of both sense and antisense strands. 

Expression vectors of the present invention - 

25 that is, those vectors that will drive expression of 

polypeptides from the inserted heterologous nucleic acid 
- will often include a variety of other genetic elements 
operatively linked to the protein-encoding heterologous 
nucleic acid insert, typically genetic elements that 

30 drive transcription, such as promoters and enhancer 

elements, those that facilitate RNA processing, such as 
transcription termination and/or polyadenylation signals, 
and those that facilitate translation, such as ribosomal 
consensus sequences. 
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For example, vectors for expressing proteins of 
the present invention in prokaryotic cells, typically E. 
coli, will include a promoter, often a phage promoter, 
such as phage lambda pL promoter, the trc promoter, a 
5 hybrid derived from the trp and lac promoters, the 

bacteriophage T7 promoter (in E. coli cells engineered to 
express the T7 polymerase) , or the araBAD operon. Often, 
such prokaryotic expression vectors will further include 
transcription terminators, such as the aspA terminator, 

10 and elements that facilitate translation, such as a 
consensus ribosome binding site and translation 
termination codon, Schomer et al . , Proc. Natl. Acad. Sci. 
USA 83:8506-8510 (1986) . 

As another example, vectors for expressing 

15 proteins of the present invention in yeast cells, 

typically S. cerevisiae, will include a yeast promoter, 
such as the CYC1 promoter, the GAL1 promoter, ADH1 
promoter, or the GPD promoter, and will typically have 
elements that facilitate transcription termination, such 

20 as the transcription termination signals from the CYC1 or 
ADH1 gene . 

As another example, vectors for expressing 
proteins of the present invention in mammalian cells will 
include a promoter active in mammalian cells. Such 

25 promoters are often drawn from mammalian viruses — such 
as the enhancer-promoter sequences from the immediate 
early gene of the human cytomegalovirus (CMV) , the 
enhancer-promoter sequences from the Rous sarcoma virus 
long terminal repeat (RSV LTR) , and the enhancer- 

30 promoter from SV40. Often, expression is enhanced by 

incorporation of polyadenylation sites, such as the late 
SV40 polyadenylation site and the polyadenylation signal 
and transcription termination sequences from the bovine 
growth hormone (BGH) gene, and ribosome binding sites. 

35 Furthermore, vectors can include introns, such as intron 
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Vector-drive protein expression can be 
constitutive or inducible. 
5 Inducible vectors include either naturally 

inducible promoters, such as the trc promoter, which is 
regulated by the lac operon, and the pL promoter, which 
is regulated by tryptophan, the MMTV-LTR promoter, which 
is inducible by dexamethasone, or can contain synthetic 

10 promoters and/or additional elements that confer 

inducible control on adjacent promoters. Examples of 
inducible synthetic promoters are the hybrid Plac/ara-1 
promoter and the PLtetO-1 promoter. The PltetO-1 
promoter takes advantage of the high expression levels 

15 from the PL promoter of phage lambda, but replaces the 
lambda repressor sites with two copies of operator 2 of 
the TnlO tetracycline resistance operon, causing this 
promoter to be tightly repressed by the Tet repressor 
protein and induced in response to tetracycline (Tc) and 

20 Tc derivatives such as anhydrotetracycline . 

As another example of inducible elements, 
hormone response elements, such as the glucocorticoid 
response element (GRE) and the estrogen response element 
(ERE) , can confer hormone inducibility where vectors are 

25 used for expression in cells having the respective 
hormone receptors. To reduce background levels of 
expression, elements responsive to ecdysone, an insect 
hormone, can be used instead, with coexpression of the 
ecdysone receptor. 

30 Expression vectors can be designed to fuse the 

expressed polypeptide to small protein tags that 
facilitate purification and/or visualization. 

For example, proteins of the present invention 
can be expressed with a polyhistidine tag that 

35 facilitates purification of the fusion protein by 
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immobilized metal affinity chromatography, for example 
using NiNTA resin (Qiagen Inc., Valencia, CA, USA) or 

TM 

TALON resin (cobalt immobilized affinity chromatography 
medium, Clontech Labs, Palo Alto, CA, USA) . As another 
5 example, the fusion protein can include a chit in-binding 
tag and self -excising intein, permitting chitin-based 

TM 

purification with self -removal of the fused tag (IMPACT 
system, New England Biolabs, Inc., Beverley, MA, USA). 
Alternatively, the fusion protein can include a 
10 calmodul in-binding peptide tag, permitting purification 
by calmodulin affinity resin (Stratagene, La Jolla, CA, 
USA) , or a specifically excisable fragment of the biotin 
carboxylase carrier protein, permitting purification of 
in vivo biotinylated protein using an avidin resin and 
15 subsequent tag removal (Promega, Madison, WI, USA) . As 
another useful alternative, the proteins of the present 
invention can be expressed as a fusion to glutathione-S- 
transf erase, the affinity and specificity of binding to 
glutathione permitting purification using glutathione 
20 affinity resins, such as Glutathione-Superf low Resin 
(Clontech Laboratories, Palo Alto, CA, USA) , with 
subsequent elution with free glutathione. 

Other tags include, for example, the Xpress 
epitope, detectable by ant i -Xpress antibody (Invitrogen, 
Carlsbad, CA, USA) , a myc tag, detectable by anti-myc tag 
antibody, the V5 epitope, detectable by anti-V5 antibody 
(Invitrogen, Carlsbad, CA, USA), FLAG® epitope, . 
detectable by anti-FLAG antibody (Stratagene, La Jolla, 
CA, USA) , and the HA epitope. 

For secretion of expressed proteins, vectors 
can include appropriate sequences that encode secretion 
signals, such as leader peptides. For example, the 
pSecTag2 vectors (Invitrogen, Carlsbad, CA, USA) are 5.2 
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kb mammalian expression vectors that carry the secretion 
signal from the V-J2-C region of the mouse Ig kappa-chain 
for efficient secretion of recombinant proteins from a 
variety of mammalian cell lines. 
5 Expression vectors can also be designed to fuse 

proteins encoded by the heterologous nucleic acid insert 
to polypeptides larger than purification and/or 
identification tags. Useful protein fusions include 
those that permit display of the encoded protein on the 
10 surface of a phage or cell, fusions to intrinsically 
fluorescent proteins, such as those that have a green 
fluorescent protein (GFP)-like chromophore, fusions to 
the IgG Fc region, and fusions for use in two hybrid 
systems . 

15 Vectors for phage display fuse the encoded 

polypeptide to, e.g., the gene III protein (pill) or gene 
VIII protein (pVIII) for display on the surface of 
filamentous phage, such as M13 . See Barbas et al . , Phage 
Display: A Laboratory Manual , Cold Spring Harbor 

20 Laboratory Press (2001) (ISBN 0-87969-546-3); Kay et al . 
(eds.), Phage Display of Peptides and Proteins: A 
Laboratory Manual , San Diego: Academic Press, Inc., 1996; 
Abelson et al. (eds.), Combinatorial Chemistry , Methods 
in Enzymology vol. 267, Academic Press (May 1996). 

25 Vectors for yeast display, e.g. the pYDl yeast 

display vector (Invitrogen, Carlsbad, CA, USA), use the 
a-agglutinin yeast adhesion receptor to display 
recombinant protein on the surface of S. cerevisiae. 
Vectors for mammalian display, e.g., the pDisplay™ 

30 vector (Invitrogen, Carlsbad, CA, USA), target 

recombinant proteins using an N- terminal cell surface 
targeting signal and a C-terminal transmembrane anchoring 
domain of platelet derived growth factor receptor. 
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A wide variety of vectors now exist that fuse 
proteins encoded by heterologous nucleic acids to the 
chromophore of the substrate- independent , intrinsically 
fluorescent green fluorescent protein from Aequorea 
5 victoria ("GFP") and its variants. These proteins are 
intrinsically fluorescent: the GFP- like chromophore is 
entirely encoded by its amino acid sequence and can 
fluoresce without requirement for cof actor or substrate. 
Structurally, the GFP- like chromophore 

10 comprises an 11-stranded p-barrel (P-can) with a central 
a-helix, the central a-helix having a conjugated n- 
resonance system that includes two aromatic ring systems 
and the bridge between them. The 7t-resonance system is 
created by autocatalytic cyclization among amino acids; 

15 cyclization proceeds through an imidazolinone 

intermediate, with subsequent dehydrogenation by 
molecular oxygen at the Ca-cp bond of a participating 
tyrosine . 

The GFP- like chromophore can be selected from 
20 GFP- like chromophores found in naturally occurring 
proteins, such as A. victoria GFP (GenBank accession 
number AAA27721) , Renilla reniformis GFP, FP583 (GenBank 
accession no. AF168419) (DsRed) , FP593 (AF272711) , FP483 
(AF168420) , FP484 (AF168424) , FP595 (AF246709) , FP486 
25 (AF168421) , FP538 (AF168423), andFP506 (AF168422) , and 
need include only so much of the native protein as is 
needed to retain the chromophore 1 s intrinsic 
fluorescence. Methods for determining the minimal domain 
required for fluorescence are known in the art. Li et 
30 al . , "Deletions of the Aequorea victoria Green Fluorescent 
Protein Define the Minimal Domain Required for 
Fluorescence," J. Biol. Chem. 272:28545-28549 (1997). 

Alternatively, the GFP- like chromophore can be 
selected from GFP-like chromophores modified from those 
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found in nature. Typically, such modifications are made 
to improve recombinant production in heterologous 
expression systems (with or without change in protein 
sequence) , to alter the excitation and/or emission 
5 spectra of the native protein, to facilitate 

purification, to facilitate or as a consequence of 
cloning, or are a fortuitous consequence of research 
investigation. 

The methods for engineering such modified GFP- 

10 like chromophores and testing them for fluorescence 

activity, both alone and as part of protein fusions, are 
well-known in the art. Early results of these efforts 
are reviewed in Heim et a!., Curr. Biol. 6:178-182 
(1996) , incorporated herein by reference in its entirety; 

15 a more recent review, with tabulation of useful 

mutations, is found in Palm et al . , "Spectral Variants of 
Green Fluorescent Protein," in Green Fluorescent 
Proteins , Conn (ed.), Methods Enzymol. vol. 302, pp. 378 
- 394 (1999), incorporated herein by reference in its 

20 entirety. A variety of such modified chromophores are 

now commercially available and can readily be used in the 
fusion proteins of the present invention. 

For example, EGFP ("enhanced GFP"), Cormack et 
al., Gene 173:33-38 (1996); U.S. Pat. Nos . 6,090,919 and 

25 5,804,387, is a red-shifted, human codon- optimized 
variant of GFP that has been engineered for brighter 
fluorescence, higher expression in mammalian cells, and 
for an excitation spectrum optimized for use in flow 
cytometers. EGFP can usefully contribute a GFP-like 

30 chromophore to the fusion proteins of the present 

invention. A variety of. EGFP vectors, both plasmid and 
viral, are available commercially (Clontech Labs, Palo 
Alto, CA, USA) , including vectors for bacterial 
expression, vectors for N-terminal protein fusion 
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expression, vectors for expression of C-terminal protein 
fusions, and for bicistronic expression. 

Toward the other end of the emission spectrum, 
EBFP ("enhanced blue fluorescent protein") and BFP2 
5 contain four amino acid substitutions that shift the 
emission from green to blue, enhance the brightness of 
fluorescence and improve solubility of the protein, Heim 
et al., Curr. Biol. 6:178-182 (1996); Cormack et al . , 
Gene 173:33-38 (1996). EBFP is optimized for expression 

10 in mammalian cells whereas BFP2, which retains the 

original jellyfish codons, can be expressed in bacteria; 
as is further discussed below, the host cell of 
production does not affect the utility of the resulting 
fusion protein. The GFP-like chromophores from EBFP and 

15 BFP2 can usefully be included in the fusion proteins of 
the present invention, and vectors containing these blue- 
shifted variants are available from Clontech Labs (Palo 
Alto, CA, USA) . 

Analogously, EYFP ("enhanced yellow fluorescent 

20 protein"), also available from Clontech Labs, contains 

four amino acid substitutions, different from EBFP, Ormo 
et al., Science 273:1392-1395 (1996), that shift the 
emission from green to yellowish-green. Citrine, an 
improved yellow fluorescent protein mutant, is described 

25 in Heikal et al., Proc. Natl. Acad. Sci . USA 97:11996- 

12001 (2000). ECFP ("enhanced cyan fluorescent protein") 
(Clontech Labs, Palo Alto, CA, USA) contains six amino 
acid substitutions, one of which shifts the emission 
spectrum from green to cyan. Heim et al., Curr. Biol. 

30 6:178-182 (1996); Miyawaki et al . , Nature 388:882-887 
(1997) . The GFP-like chromophore of each of these GFP 
variants can usefully be included in the fusion proteins 
of the present invention. 

The GFP-like chromophore can also be drawn from 

35 other modified GFPs, including those described in U.S. 
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Pat. Nos. 6,124,128; 6,096,865; 6,090,919; 6,066,476; 
6,054,321; 6,027,881; 5,968,750; 5,874,304; 5,804,387; 
5,777,079; 5,741,668; and 5,625,048, the disclosures of 
which are incorporated herein by reference in their 
5 entireties. See also Conn (ed.), Green Fluorescent 

Protein , Methods in Enzymol . Vol. 302, pp 378-394 (1999), 
incorporated herein by reference in its entirety. A 
variety of such modified chromophores are now 
commercially available and can readily be used in the 
10 fusion proteins of the present invention. 

Fusions to the IgG Fc region increase serum 
half life of protein pharmaceutical products through 
interaction with the FcRn receptor (also denominated the 
FcRp receptor and the Brambell receptor, FcRb) , further 
15 described in international patent application nos. WO 
97/43316, WO 97/34631, WO 96/32478, WO 96/18412. 

For long-term, high-yield recombinant 
production of the proteins, protein fusions, and protein 
fragments of the present invention, stable expression is 
20 particularly useful. 

Stable expression is readily achieved by 
integration into the host cell genome of vectors having 
selectable markers, followed by selection for integrants. 

For example, the pUB6/V5-His A, B, and C 
25 vectors (Invitrogen, Carlsbad, CA, USA) are designed for 
high-level stable expression of heterologous proteins in 
a wide range of mammalian tissue types and cell lines. 
pUB6/V5-His uses the promoter/enhancer sequence from the 
human ubiquitin C gene to drive expression of recombinant 
30 proteins: expression levels in 293, CHO, and NIH3T3 cells 
are comparable to levels from the CMV and human EF-la 
promoters. The bsd gene permits rapid selection of 
stably transfected mammalian cells with the potent 
antibiotic blasticidin. 
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Replication incompetent retroviral vectors, 
typically derived from Moloney murine leukemia virus, 
prove particularly useful for creating stable 
transfectants having integrated provirus. The highly 
5 efficient transduction machinery of retroviruses, coupled 
with the availability of a variety of packaging cell 

™ ™ 
lines - such as RetroPack PT 67, EcoPack2 -293, 

AmphoPack-293, GP2-293 cell lines (all available from 
Clontech Laboratories, Palo Alto, CA, USA) - allow a wide 

10 host range to be infected with high efficiency; varying 
the multiplicity of infection readily adjusts the copy 
number of the integrated provirus . Retroviral vectors 
are available with a variety of selectable markers, such 
as resistance to neomycin, hygromycin, and puromycin, 

15 permitting ready selection of stable integrants. 

The present invention further includes host 
cells comprising the vectors of the present invention, 
either present episomally within the cell or integrated, 

20 in whole or in part, into the host cell chromosome. 

Among other considerations, some of which are 
described above, a host cell strain may be chosen for its 
ability to process the expressed protein in the desired 
fashion. Such post- translational modifications of the 

25 polypeptide include, but are not limited to, acetylation, 
carboxylation, glycosylation, phosphorylation, 
lipidation, and acylation, and it is an aspect of the 
present invention to provide AMLP1 proteins with such 
post-translational modifications . 

30 As noted earlier, host cells can be prokaryotic 

or eukaryotic. Representative examples of appropriate 
host cells include, but are not limited to, bacterial 
cells, such as E. coli, Caulobacter crescentus, 
Streptomyces species, and Salmonella typhimurium; yeast 
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cells, such as Saccharomyces cerevisiae, 
Schizosaccharomyces pombe, Pichia pastoris, Pichia 
methanol ica; insect cell lines, such as those from 
Spodoptera frugiperda - e.g., Sf9 and Sf21 cell lines, 

TM 

5 and expresSF cells (Protein Sciences Corp., Meriden, CT, 
USA) - Drosophila S2 cells, and Trichoplusia ni High 
Five® Cells (Invitrogen, Carlsbad, CA, USA) ; and 
mammalian cells. Typical mammalian cells include COS1 
and COS7 cells, Chinese hamster ovary (CHO) cells, NIH 

10 3T3 cells, 293 cells, HEPG2 cells, HeLa cells, L cells, 
murine ES cell lines (e.g., from strains 129/SV, C57/BL6, 
DBA- 1 , 129/SVJ) , K562, Jurkat cells, andBW5147. Other 
mammalian cell lines are well known and readily available 
from the American Type Culture Collection (Manassas, VA, 

15 USA) and the National Institute of General medical 

Sciences (NIGMS) Human Genetic Cell Repository at the 
Coriell Cell Repositories (Camden, NJ, USA) . 

Methods for introducing the vectors and nucleic 
acids of the present invention into the host cells are 

20 well known in the art; the choice of technique will 
depend primarily upon the specific vector to be 
introduced and the host cell chosen. 

For example, phage lambda vectors will 
typically be packaged using a packaging extract (e.g., 

25 Gigapack® packaging extract, Stratagene, La Jolla, CA, 
USA), and the packaged virus used to infect E. coli. 
Plasmid vectors will typically be introduced into 
chemically competent or electrocompetent bacterial cells. 

30 E. coli cells can be rendered chemically 

competent by treatment, e.g., with CaCl 2 , or a solution 
of Mg 2+ , Mn 2 \ Ca 2+ , Rb + or K + , dimethyl sulfoxide, 
dithiothreitol, and hexamine cobalt (III), Hanahan, J. 
Mol. Biol. 166 (4) :557-80 (1983), and vectors introduced 
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by heat shock. A wide variety of chemically competent 
strains are also available commercially (e.g., Epicurian 
Coli® XLIO-Gold® Ultracompetent Cells (Stratagene, La 
Jolla, CA, USA) ; DH5ot competent cells (Clontech 
5 Laboratories, Palo Alto, CA, USA); TOP10 Chemically 

Competent E. coli Kit (Invitrogen, Carlsbad, CA, USA)). 

Bacterial cells can be rendered 
electrocompetent - that is, competent to take up 
exogenous DNA by electroporation - by various pre-pulse 

10 treatments; vectors are introduced by electroporation 
followed by subsequent outgrowth in selected media. An 
extensive series of protocols is provided online in 
Elect roprotocols (BioRad, Richmond, CA, USA) 
(http://www.bio- 

15 rad.com/LifeScience/pdf/New_Gene_Pulser.pdf) . 

Vectors can be introduced into yeast cells by 
spheroplasting, treatment with lithium salts, 
electroporation, or protoplast fusion. 

Spheroplasts are prepared by the action of 

20 hydrolytic enzymes - a snail-gut extract, usually denoted 
Glusulase, or Zymolyase, an enzyme from Arthrobacter 
luteus — to remove portions of the cell wall in the 
presence of osmotic stabilizers, typically 1 M sorbitol. 
DNA is added to the spheroplasts, and the mixture is 

25 co-precipitated with a solution of polyethylene glycol 
(PEG) and Ca 2+ . Subsequently, the cells are resuspended 
in a solution of sorbitol, mixed with molten agar and 
then layered on the surface of a selective plate 
containing sorbitol. For lithium-mediated 

30 transformation, yeast cells are treated with lithium 
acetate, which apparently permeabilizes the cell wall, 
DNA is added and the cells are co-precipitated with PEG. 

The cells are exposed to a brief heat shock, washed free 
of PEG and lithium acetate, and subsequently spread on 

35 plates containing ordinary selective medium. Increased 
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frequencies of transformation are obtained by using 
specially-prepared single-stranded carrier DNA and 
certain organic solvents. Schiestl et al., Curr. Genet. 
16(5-6) :339-46 (1989). For electroporation, 
5 freshly-grown yeast cultures are typically washed, 
suspended in an osmotic protectant, such as sorbitol, 
mixed with DNA, and the cell suspension pulsed in an 
electroporation device. Subsequently, the cells are 
spread on the surface of plates containing selective 

10 media. Becker et al . , Methods Enzymol. 194:182-7 (1991). 
The efficiency of transformation by electroporation can 
be increased over 100-fold by using PEG, single -stranded 
carrier DNA and cells that are in late log-phase of 
growth. Larger constructs, such as YACs, can be 

15 introduced by protoplast fusion. 

Mammalian and insect cells can be directly 
infected by packaged viral vectors, or transfected by 
chemical or electrical means. 

For chemical transfection, DNA can be 

20 coprecipitated with CaP0 4 or introduced using liposomal 

and nonliposomal lipid-based agents. Commercial kits are 
available for CaP0 4 transfection (CalPhos™ Mammalian 
Transfection Kit, Clontech Laboratories, Palo Alto, CA, 
USA) , and lipid-mediated transfection can be practiced 

25 using commercial reagents, such as LIPOFECTAMINE™ 2000, 
LIPOFECTAMINE™ Reagent, CELLFECTIN® Reagent, and 
LIPOFECTIN® Reagent (Invitrogen, Carlsbad, CA, USA) , 
DOTAP Liposomal Transfection Reagent, FuGENE 6, 
X-tremeGENE Q2, DOS PER f (Roche Molecular Biochemicals, 

30 Indianapolis, IN USA) , Effectene™, PolyFect®, Superfect® 
(Qiagen, Inc., Valencia, CA, USA). Protocols for 
electroporating mammalian cells can be found online in 
Electroprotocols (Bio-Rad, Richmond, CA, USA) 
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(http://www.bio- 

rad . com/Lif eScience/pdf /New_Gene_Pulser . pdf ) . 
See also, Norton et al . (eds.), Gene Transfer Methods: 
Introducing DNA into Living Cells and Organisms , 
5 BioTechniques Books, Eaton Publishing Co. (2000) (ISBN 
1-881299-34-1) , incorporated herein by reference in its 
entirety. 

Other transf ectlon techniques include 
transfection by particle embardment. See, e.g., Cheng et 
10 al., Proc. Natl. Acad. Sci. USA 90 (10) : 4455-9 (1993); 
Yang et al., Proc. Natl. Acad. Sci. USA 87 (24) : 9568-72 
(1990) . 



PROTEINS 

15 

In another aspect, the present invention 
provides AMLP1 proteins, various fragments thereof 
suitable for use as antigens (e.g., for epitope mapping) 
and for use as immunogens (e.g., for raising antibodies 

20 or as vaccines) , fusions of AMLP1 polypeptides and 

fragments to heterologous polypeptides, and conjugates of 
the proteins, fragments, and fusions of the present 
invention to other moieties (e.g., to carrier proteins, 
to f luorophores) . 

25 FIG. 3 and FIG. 4 presents the predicted amino 

acid sequences encoded by the AMLPla and AMLPlb cDNA 
clones, respectively. The amino acid sequences are 
further presented, respectively, in SEQ ID NO: 3 and SEQ 
ID NO: 6. 

30 Unless otherwise indicated, amino acid 

sequences of the proteins of the present invention were 
determined as a predicted translation from a nucleic acid 
sequence. Accordingly, any amino acid sequence presented 
herein may contain errors due to errors in the nucleic 

35 acid sequence, as described in detail above. 
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Furthermore, single nucleotide polymorphisms (SNPs) occur 
frequently in eukaryotic genomes - more than 1.4 million 
SNPs have already identified in the human genome, 
International Human Genome Sequencing Consortium, Nature 
5 409:860 - 921 (2001) - and the sequence determined from 
one individual of a species may differ from other allelic 
forms present within the population. Small deletions and 
insertions can often be found that do not alter the 
function of the protein. 

1.0 Accordingly, it is an aspect of the present 

invention to provide proteins not only identical in 
sequence to those described with particularity herein, 
but also to provide isolated proteins at least about 65% 
identical in sequence to those described with 

15 particularity herein, typically at least about 70%, 75%, 
80%, 85%, or 90% identical in sequence to those described 
with particularity herein, usefully at least about 91%, 
92%, 93%, 94%, or 95% identical in sequence to those 
described with particularity herein, usefully at least 

20 about 96%, 97%, 98%, or 99% identical in sequence to 
those described with particularity herein, and, most 
conservatively, at least about 99.5%, 99.6%, 99.7%, 99.8% 
and 99.9% identical in sequence to those described with 
particularity herein. These sequence variants can be 

25 naturally occurring or can result from human intervention 
by way of random or directed mutagenesis. 

For purposes herein, percent identity of two 
amino acid sequences is determined using the procedure of 
Tatiana et al . , "Blast 2 sequences - a new tool for 

30 comparing protein and nucleotide sequences", FEMS 

Microbiol Lett. 174:247-250 (1999), which procedure is 
effectuated by the computer program BLAST 2 SEQUENCES, 
available online at 

35 http : //www . ncbi . nlm . nih . gov/blast /bl2seq/bl2 . html , 

82 



WO 03/037931 



PCT/US02/35129 



To assess percent identity of amino acid sequences, the 
BLASTP module of BLAST 2 SEQUENCES is used with default 
values of (i) BLOSUM62 matrix, Henikoff et al . , Proc. 
5 Natl. Acad. Sci USA 89 (22) : 10915-9 (1992); (ii) open gap 
11 and extension gap 1 penalties; and (iii) gap x_dropoff 
50 expect 10 word size 3 filter, and both sequences are 
entered in their entireties. 

As is well known, amino acid substitutions 

10 occur frequently among natural allelic variants, with 
conservative substitutions often occasioning only de 
minimis change in protein function. 

Accordingly, it is an aspect of the present 
invention to provide proteins not only identical in 

15 sequence to those described with particularity herein, 

but also to provide isolated proteins having the sequence 
of AMLP1 proteins, or portions thereof, with conservative 
amino acid substitutions. It is a further aspect to 
provide isolated proteins having the sequence of AMLPl 

20 proteins, and portions thereof, with moderately 
conservative amino acid substitutions. These 
conservatively-substituted and moderately conservatively- 
substituted variants can be naturally occurring or can 
result from human intervention. 

25 Although there are a variety of metrics for 

calling conservative amino acid substitutions, based 
primarily on either observed changes among evolutionarily 
related proteins or on predicted chemical similarity, for 
purposes herein a conservative replacement is any change 

30 having a positive value in the PAM250 log-likelihood 
matrix reproduced herein below (see Gonnet et al., 
Science 256 (5062) : 1443-5 (1992)): 
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20 

For purposes herein, a "moderately conservative" 
replacement is any change having a nonnegative value in 
the PAM250 log-likelihood matrix reproduced herein above. 

As is also well known in the art, relatedness 

25 of proteins can also be characterized using a functional 
test, the ability of the encoding nucleic acids to base- 
pair to one another at defined hybridization 
stringencies . 

It is, therefore, another aspect of the 

30 invention to provide isolated proteins not only identical 
in sequence to those described with particularity herein, 
but also to provide isolated proteins ("hybridization 
related proteins") that are encoded by nucleic acids that 
hybridize under high stringency conditions (as defined 

35 herein above) to all or to a portion of various of the 
isolated nucleic acids of the present invention 
("reference nucleic acids") . It is a further aspect of 
the invention to provide isolated proteins 
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("hybridization related proteins") that are encoded by 
nucleic acids that hybridize under moderate stringency 
conditions (as defined herein above) to all or to a 
portion of various of the isolated nucleic acids of the 
5 present invention ("reference nucleic acids"). 

The hybridization related proteins can be 
alternative isoforms, homologues, paralogues, and 
orthologues of the AMLP1 protein of the present 
invention. Particularly useful orthologues are those 

10 from other primate species, such as chimpanzee, rhesus 
macaque monkey, baboon, orangutan, and gorilla, from 
rodents, such as rats, mice, guinea pigs; from 
lagomorphs, such as rabbits, and from domestic livestock, 
such as cow, pig, sheep, horse, and goat. 

15 Relatedness of proteins can also be 

characterized using a second functional test, the ability 
of a first protein competitively to inhibit the binding 
of a second protein to an antibody. 

It is, therefore, another aspect of the present 

20 invention to provide isolated proteins not only identical 
in sequence to those described with particularity herein, 
but also to provide isolated proteins ("cross-reactive 
proteins") that competitively inhibit the binding of 
antibodies to all or to a portion of various of the 

25 isolated AMLP1 proteins of the present invention 

("reference proteins"). Such competitive inhibition can 
readily be determined using immunoassays well known in 
the art. 

Among the proteins of the present invention 
30 that differ in amino acid sequence from those described 
with particularity herein — including those that have 
deletions and insertions causing up to 10% non- identity, 
those having conservative or moderately conservative 
substitutions, hybridization related proteins, and cross- 
35 reactive proteins - those that substantially retain one 
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or more AMLPl activities are particularly useful. As 
described above, AMLPl plays a role similar to that of 
angiomotin as an adaptor protein that interacts with both 
angiostatin-like protein and components of the actin 
5 cytoskeleton and has anti-angiogenesis activity. 

Residues that are tolerant of change while 
retaining function can be identified by altering the 
protein at known residues using methods known in the art, 
such as alanine scanning mutagenesis, Cunningham et al . , 

10 Science 244 (4908) :1081-5 (1989); transposon linker 

scanning mutagenesis, Chen et al . , Gene 263(1-2) : 39-48 
(2001) ; combinations of homolog- and alanine -scanning 
mutagenesis, Jin et al., J. Mol . Biol. 226 (3) :851-65 
(1992); combinatorial alanine scanning, Weiss et al . , 

15 Proc. Natl. Acad. Sci USA 97 (16) : 8950-4 (2000), followed 
by functional assay. Transposon linker scanning kits are 
available commercially (New England Biolabs, Beverly, MA, 
USA, catalog, no. E7-102S; EZ::TN™ In-Frame Linker 
Insertion Kit, catalogue no. EZI04KN, Epicentre 

20 Technologies Corporation, Madison, WI, USA) . 

As further described below, the isolated 
proteins of the present invention can readily be used as 
specific immunogens to raise antibodies that specifically 
recognize AMLPl proteins, their isoforms, homologues, 

25 paralogues, and/or orthologues. The antibodies, in turn, 
can be used, inter alia, specifically to assay for the 
AMLPl proteins of the present invention - e.g. by ELISA 
for detection of protein fluid samples, such as serum, by 
immunohistochemistry or laser scanning cytometry, for 

30 detection of protein in tissue samples, or by flow 

cytometry, for detection of intracellular protein in cell 
suspensions — for specific antibody-mediated isolation 
and/or purification of AMLPl proteins, as for example by 
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immunoprecipitation, and for use as specific agonists or 
antagonists of AMLP1 action. 

The isolated proteins of the present invention 
are also immediately available for use as specific 
5 standards in assays used to determine the concentration 
and/or amount specifically of the AMLP1 proteins of the 
present invention. As is well known, ELISA kits for 
detection and quantitation of protein analytes typically 
include isolated and purified protein of known 

10 concentration for use as a measurement standard (e.g., 
the human interferon-y OptEIA kit, catalog no. 555142, 
Pharmingen, San Diego, CA, USA includes human recombinant 
gamma interferon, baculovirus produced) . 

The isolated proteins of the present invention 

15 are also immediately available for use as specific 

biomolecule capture probes for surf ace -enhanced laser 
desorption ionization (SELDI) detection of protein- 
protein interactions, WO 98/59362; WO 98/59360; WO 
98/59361; and Merchant et-al., Electrophoresis 

20 21(6) :1164-77 (2000), the disclosures of which are 

incorporated herein by reference in their entireties. 
Analogously, the isolated proteins of the present 
invention are also immediately available for use as 
specific biomolecule capture probes on BIACORE surface 

25 plasmon resonance probes. . See Weinberger et al., 

Pharmacogenomics 1 (4) : 395-416 (2000); Malmqvist, Biochem. 
Soc. Trans. 27(2):335-40 (1999). 

The isolated proteins of the present invention 
are also useful as a therapeutic supplement in patients 

30 having a specific deficiency in AMLP1 production. 

In another aspect, the invention also provides 
fragments of various of the proteins of the present 
invention. The protein fragments are useful, inter alia, 
as antigenic and immunogenic fragments of AMLP1 . 
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By "fragments" of a protein is here intended 
isolated proteins (equally, polypeptides, peptides, 
oligopeptides) , however obtained, that have an amino acid 
sequence identical to a portion of the reference amino 
5 acid sequence, which portion is at least 6 amino acids 
and less than the entirety of the reference nucleic acid. 

As so defined, "fragments" need not be obtained by 
physical fragmentation of the reference protein, although 
such provenance is not thereby precluded. 

10 Fragments of at least 6 contiguous amino acids 

are useful in mapping B cell and T cell epitopes of the 
reference protein. See, e.g., Geysen et al . , "Use of 
peptide synthesis to probe viral antigens for epitopes to 
a resolution of a single amino acid," Proc. Natl. Acad. 

15 Sci. USA 81:3998-4002 (1984) and U.S. Pat. Nos. 4,708,871 
and 5,595,915, the disclosures of which are incorporated 
herein by reference in their entireties. Because the 
fragment need not itself be immunogenic, part of an 
immunodominant epitope, nor even recognized by native 

20 antibody, to be useful in such epitope mapping, all 

fragments of at least 6 amino acids of the proteins of 
the present invention have utility in such a study. 

Fragments of at least 8 contiguous amino acids, 
often at least 15 contiguous amino acids, have utility as 

25 immunogens for raising antibodies that recognize the 
proteins of the present invention. See, e.g., Lerner, 
"Tapping the immunological repertoire to produce 
antibodies of predetermined specificity," Nature 299:592- 
596 (1982); Shinnick et al., "Synthetic peptide 

30 immunogens as vaccines," Annu. Rev. Microbiol. 37:425-46 
(1983); Sutcliffe et al . , "Antibodies that react with 
predetermined sites on proteins," Science 219:660-6 
(1983) , the disclosures of which are incorporated herein 
by reference in their entireties. As further described 

35 in the above-cited references, virtually all 8-mers, 
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conjugated to a carrier, such as a protein, prove 
immunogenic - that is, prove capable of eliciting 
antibody for the conjugated peptide; accordingly, all 
fragments of at least 8 amino acids of the proteins of 
5 the present invention have utility as immunogens. 

Fragments of at least 8, 9, 10 or 12 contiguous 
amino acids are also useful as competitive inhibitors of 
binding of the entire protein, or a portion thereof, to 
antibodies (as in epitope mapping) , and to natural 

10 binding partners, such as subunits in a multimeric 
complex or to receptors or ligands of the subject 
protein; this competitive inhibition permits 
identification and separation of molecules that bind 
specifically to the protein of interest, U.S. Pat. Nos. 

15 5,539,084 and 5,783,674, incorporated herein by reference 
in their entireties. 

The protein, or protein fragment, of the 
present invention is thus at least 6 amino acids in 
length, typically at least 8, 9, 10 or 12 amino acids in 

20 length, and often at least 15 amino acids in length. 

Often, the protein or the present invention, or fragment 
thereof, is at least 20 amino acids in length, even 25 
amino acids, 30 amino acids, 35 amino acids, or 50 amino 
acids or more in length. Of course, larger fragments 

25 having at least 75 amino acids, 100 amino acids, or even 
150 amino acids are also useful, and at times preferred. 

The present invention further provides fusions 
of each of the proteins and protein fragments of the 
present invention to heterologous polypeptides. 

30 By fusion is here intended that the protein or 

protein fragment of the present invention is linearly 
contiguous to the heterologous polypeptide in a peptide- 
bonded polymer of amino acids or amino acid analogues; by 
"heterologous polypeptide" is here intended a polypeptide 

35 that does not naturally occur in contiguity with the 
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protein or protein fragment of the present invention. As 
so defined, the fusion can consist entirely of a 
plurality of fragments of the AMLP1 protein in altered 
arrangement; in such case, any of the AMLPl fragments can 
5 be considered heterologous to the other AMLPl fragments 
in the fusion protein. More typically, however, the 
heterologous polypeptide is not drawn from the AMLPl 
protein itself. 

The fusion proteins of the present invention 

10 will include at least one fragment of the protein of the 
present invention, which fragment is at least 6, 
typically at least 8, often at least 15, and usefully at 
least 16, 17, 18, 19, or 20 amino acids long. The 
fragment of the protein of the present to be included in 

15 the fusion can usefully be at least 25 amino acids long, 
at least 50 amino acids long, and can be at least 75, 
100, or even 150 amino acids long. Fusions that include 
the entirety of the proteins of the present invention 
have particular utility. 

20 The heterologous polypeptide included within 

the fusion protein of the present invention is at least 6 
amino acids in length, often at least 8 amino acids in 
length, and usefully at least 15, 20, and 25 amino acids 
in length. Fusions that include larger polypeptides, 

25 such as the IgG Fc region, and even entire proteins (such 
as GFP chromophore -containing proteins) , have particular 
utility. 

As described above in the description of 
vectors and expression vectors of the present invention, 

30 which discussion is incorporated herein by reference in 
its entirety, heterologous polypeptides to be included in 
the fusion proteins of the present invention can usefully 
include those designed to facilitate purification and/or 
visualization of recombinantly-expressed proteins. 

35 Although purification tags can also be incorporated into 
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fusions that are chemically synthesized, chemical 
synthesis typically provides sufficient purity that 
further purification by HPLC suffices; however, 
visualization tags as above described retain their 
5 utility even when the protein is produced by chemical 
synthesis, and when so included render the fusion 
proteins of the present invention useful as directly 
detectable markers of AMLP1 presence. 

As also discussed above, heterologous 

10 polypeptides to be included in the fusion proteins of the 
present invention can usefully include those that 
facilitate secretion of recombinantly expressed 
proteins - into the periplasmic space or extracellular 
milieu for prokaryotic hosts, into the culture medium for 

15 eukaryotic cells — through incorporation of secretion 
signals and/or leader sequences. 

Other useful protein fusions of the present 
invention include those that permit use of the protein of 
the present invention as bait in a yeast two-hybrid 

20 system. See Bartel et al . (eds.), The Yeast Two -Hybrid 
System , Oxford University Press (1997) (ISBN: 
0195109384); Zhu et al . , Yeast Hybrid Technologies , Eaton 
Publishing, (2000) (ISBN 1-881299-15-5); Fields et al . , 
Trends Genet. 10(8): 286-92 (1994); Mendelsohn et al . , 

25 Curr. Opin. Biotechnol . 5(5):482-6 (1994); Luban et al . , 
Curr. Opin. Biotechnol. 6(l):59-64 (1995); Allen et al . , 
Trends Biochem. Sci. 20(12) :511-6 (1995); Drees, Curr. 
Opin. Chem. Biol. 3(1): 64-70 (1999); Topcu et al . , Pharm. 
Res. 17 (9) :1049-55 (2000); Fashena et al . , Gene 

30 250(1-2) :1-14 (2000), the disclosures of which are 

incorporated herein by reference in their entireties. 
Typically, such fusion is to either E. coli LexA or yeast 
GAL4 DNA binding domains. Related bait plasmids are 
available that express the bait fused to a nuclear 

35 localization signal. 
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Other useful protein fusions include those that 
permit display of the encoded protein on the surface of a 
phage or cell, fusions to intrinsically fluorescent 
proteins, such as green fluorescent protein (GFP) , and 
5 fusions to the IgG Fc region, as described above, which 
discussion is incorporated here by reference in its 
entirety. 

The proteins and protein fragments of the 
present invention can also usefully be fused to protein 

10 toxins, such as Pseudomonas exotoxin A, diphtheria toxin, 
Shiga toxin A, anthrax toxin lethal factor, ricin, in 
order to effect ablation of cells that bind or take up 
the proteins of the present invention. 

The isolated proteins, protein fragments, and 

15 protein fusions of the present invention can be composed 
of natural amino acids linked by native peptide bonds, or 
can contain any or all of nonnatural amino acid 
analogues, nonnative bonds, and post -synthetic (post 
translational) modifications, either throughout the 

20 length of the protein or localized to one or more 
portions thereof. 

As is well known in the art, when the isolated 
protein is used, e.g., for epitope mapping, the range of 
such nonnatural analogues, nonnative inter- residue bonds, 

25 or post -synthesis modifications will be limited to those 
that permit binding of the peptide to antibodies. When 
used as an immunogen for the preparation of antibodies in 
a non-human host, such as a mouse, the range of such 
nonnatural analogues, nonnative inter-residue bonds, or 

30 post-synthesis modifications will be limited to those 
that do not interfere with the immunogenicity of the 
protein. When the isolated protein is used as a 
therapeutic agent, such as a vaccine or for replacement 
therapy, the range of such changes will be limited to 
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those that do not confer toxicity upon the isolated 
protein. 

Non-natural amino acids can be incorporated 
during solid phase chemical synthesis or by recombinant 
5 techniques, although the former is typically more common. 

Solid phase chemical synthesis of peptides is * 
well established in the art. Procedures are described, 
inter alia, in Chan et al. (eds.), Fmoc Solid Phase 
Peptide Synthesis: A Practical Approach (Practical 

10 Approach Series) , Oxford Univ. Press (March 2000) (ISBN: 
0199637245) ; Jones, Amino Acid and Peptide Synthesis 
(Oxford Chemistry Primers, No 7) , Oxford Univ. Press 
(August 1992) (ISBN: 0198556683); and Bodanszky, 
Principles of Peptide Synthesis (Springer Laboratory) , 

15 Springer Verlag (December 1993) (ISBN: 0387564314), the 
disclosures of which are incorporated herein by reference 
in their entireties. 

For example, D-enantiomers of natural amino 
acids can readily be incorporated during chemical peptide 

20 synthesis: peptides assembled from D-amino acids are more 
resistant to proteolytic attack; incorporation of D- 
enantiomers can also be used to confer specific three 
dimensional conformations on the peptide. Other amino 
acid analogues commonly added during chemical synthesis 

25 include ornithine, norleucine, phosphorylated amino acids 
(typically phosphoserine, phosphothreonine , 
phosphotyrosine) , L-malonyltyrosine, a non-hydrolyzable 
analog of phosphotyrosine (Kole et al . , Biochem. Biophys. 
Res. Com. 209:817-821 (1995)), and various halogenated 

30 phenylalanine derivatives. 

Amino acid analogues having detectable labels 
are also usefully incorporated during synthesis to 
provide a labeled polypeptide. 

Biotin, for example (indirectly detectable 

35 through interaction with avidin, streptavidin, 
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neutravidin, captavidin, or anti-biotin antibody) , can be 
added using biotinoyl-- (9-f luorenylmethoxycarbonyl) -L- 
lysine (FMOC biocytin) (Molecular Probes, Eugene, OR, 
USA) . (Biotin can also be added enzymatically by 
5 incorporation into a fusion protein of a E. coli BirA 
substrate peptide.) 

The FMOC and tBOC derivatives of 
dabcyl -L-lysine (Molecular Probes, Inc., Eugene, OR, USA) 
can be used to incorporate the dabcyl chromophore at 

10 selected sites in the peptide sequence during synthesis. 
The aminonaphthalene derivative EDANS, the most common 
fluorophore for pairing with the dabcyl quencher in 
fluorescence resonance energy transfer (FRET) systems, 
can be introduced during automated synthesis of peptides 

15 by using EDANS- -FMOC-L-glutamic acid or the corresponding 
tBOC derivative (both from Molecular Probes, Inc., 
Eugene, OR, USA) . Tetramethylrhodamine fluorophores can 
be incorporated during automated FMOC synthesis of 
peptides using (FMOC) --TMR-L- lysine (Molecular Probes, 

20 Inc. Eugene, OR, USA). 

Other useful amino acid analogues that can be 
incorporated during chemical synthesis include aspartic 
acid, glutamic acid, lysine, and tyrosine analogues 
having allyl side-chain protection (Applied Biosystems, 

25 Inc., Foster City, CA, USA) ; the allyl side chain permits 
synthesis of cyclic, branched- chain, sulfonated, 
glycosylated, and phosphorylated peptides. 

A large number of other FMOC-protected non- 
natural amino acid analogues capable of incorporation 

30 during chemical synthesis are available commercially, 
including, e.g. , Fmoc-2-aminobicyclo [2.2.1] heptane-2- 
carboxylic acid, Fmoc-3-endo-aminobicyclo [2.2.1] heptane- 
2-endo-carboxylic acid, Fmoc-3-exo- 

aminobicyclo [2.2.1] heptane-2-exo-carboxylic acid, Fmoc-3- 
35 endo-amino-bicyclo [2.2.1] hept -5 -ene- 2-endo-carboxylic 
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acid, Fmoc-3-exo-amino-bicyclo [2.2.1] hept-5-ene-2-exo- 
carboxylic acid, Fmoc-cis-2 -amino- 1-cyclohexanecarboxylic 
acid , Fmoc - trans - 2 - amino - 1 - cyclohexanecarboxyl ic ac id , 
Fmoc - 1 - amino - 1 - eye lopen t anecarboxy 1 ic acid, Fmoc - c i s - 2 - 
5 amino- 1-cyclopentanecarboxylic acid, Fmoc -1 -amino- 1- 
cyclopropanecarboxylic acid, Fmoc-D-2-amino-4- 
(ethylthio) butyric acid, Fmoc-L-2-amino-4- 
(ethylthio) butyric acid, Fmoc-L-buthionine, Fmoc-S- 
methyl -L- Cysteine, Fmoc-2-aminobenzoic acid (anthranillic 

10 acid) , Fmoc-3-aminobenzoic acid, Fmoc-4-aminobenzoic 

acid, Fmoc-2-aminobenzophenone-2 1 -carboxylic acid, Fmoc- 
N- (4-aminobenzoyl) -b-alanine, Fmoc -2 -amino -4 , 5- 
dimethoxybenzoic acid, Fmoc-4-aminohippuric acid, Fmoc-2- 
amino-3-hydroxybenzoic acid, Fmoc -2 -amino- 5- 

15 hydroxybenzoic acid, Fmoc-3-amino-4-hydroxybenzoic acid, 
Fmoc - 4 - amino - 3 - hydroxybenzo ic ac id , Fmoc - 4 - amino - 2 - 
hydroxybenzoic acid, Fmoc-5-amino-2-hydroxybenzoic acid, 
Fmoc - 2 - amino- 3 -me thoxybenzoic acid , Fmoc - 4 - amino - 3 - 
met hoxyben zoic acid, Fmoc-2-amino-3-methylbenzoic acid, 

20 Fmoc-2-amino-5-methylbenzoic acid, Fmoc- 2 -amino- 6- 

methylbenzoic acid, Fmoc-3-amino-2-methylbenzoic acid, 
Fmoc-3-amino-4-methylbenzoic acid, Fmoc-4-amino-3- 
methylbenzoic acid, Fmoc-3-amino-2-naphtoic acid, Fmoc- 
D , L- 3 -amino- 3 -phenylpropionic acid , Fmoc - L - Methyldopa , 

25 Fmoc-2-amino-4 , 6-dimethyl-3-pyridinecarboxylic acid, 
Fmoc-D,L-?-amino-2-thiophenacetic acid, Fmoc-4- 
( carboxyme thyl ) piperaz ine , Fmoc - 4 - carboxypiperaz ine , 
Fmoc - 4 - ( carboxyme thyl ) homopiperaz ine , Fmoc - 4 -phenyl - 4 - 
piperidinecarboxylic acid, Fmoc-L-1 , 2 , 3 , 4 - 

30 tetrahydronorharman- 3 -carboxylic acid, Fmoc-L- 

thiazolidine-4 -carboxylic acid, all available from The 
Peptide Laboratory (Richmond, CA, USA) . 

Non-natural residues can also be added 
biosynthetically by engineering a suppressor tRNA, 

35 typically one that recognizes the UAG stop codon, by 
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chemical aminoacylation with the desired unnatural amino 
acid and. Conventional site-directed mutagenesis is used 
to introduce the chosen stop codon UAG at the site of 
interest in the protein gene . When the acylated 
5 suppressor tRNA and the mutant gene are combined in an in 
vitro transcription/translation system, the unnatural 
amino acid is incorporated in response to the UAG codon 
to give a protein containing that amino acid at the 
specified position. Liu et al . , Proc. Natl Acad. Sci. 
10 USA 96(9):4780-5 (1999); Wang et al . , Science 
292(5516) :498-500 (2001) . 

The isolated proteins, protein fragments and 
fusion proteins of the present invention can also include 
nonnative inter-residue bonds, including bonds that lead 
15 to circular and branched forms. 

The isolated proteins and protein fragments of 
the present invention can also include post-translational 
and post -synthetic modifications, either throughout the 
length of the protein or localized to one or more 
20 portions thereof. 

For example, when produced by recombinant 
expression in eukaryotic cells, the isolated proteins, 
fragments, and fusion proteins of the present invention 
will typically include N-linked and/or O-linked 
25 glycosylation, the pattern of which will reflect both the 
availability of glycosylation sites on the protein 
sequence and the identity of the host cell. Further 
modification of glycosylation pattern can be performed 
enzymatically . 

30 As another example, recombinant polypeptides of 

the invention may also include an initial modified 
methionine residue, in some cases resulting from host- 
mediated processes. 

When the proteins, protein fragments, and 

35 protein fusions of the present invention are produced by 
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chemical synthesis, post-synthetic modification can be 
performed before deprotection and cleavage from the resin 
or after deprotection and cleavage. Modification before 
deprotection and cleavage of the synthesized protein 
5 often allows greater control, e.g. by allowing targeting 
of the modifying moiety to the N-terminus of a 
resin-bound synthetic peptide. 

Useful post -synthetic (and post-translational) 
modifications include conjugation to detectable labels, 

10 such as f luorophores . 

A wide variety of amine-reactive and thiol - 
reactive fluorophore derivatives have been synthesized 
that react under nondenaturing conditions with N-terminal 
amino groups and epsilon amino groups of lysine residues, 

15 on the one hand, and with free thiol groups of cysteine 
residues, on the other. 

Kits are available commercially that permit 
conjugation of proteins to a variety of amine-reactive or 
thiol-reactive f luorophores : Molecular Probes, Inc. 

20 (Eugene, OR, USA), e.g., offers kits for conjugating 
proteins to Alexa Fluor 350, Alexa Fluor 430, 
Fluorescein-EX, Alexa Fluor 488, Oregon Green 488, Alexa 
Fluor 532, Alexa Fluor 546, Alexa Fluor 546, Alexa Fluor 
568, Alexa Fluor 594, and Texas Red-X. 

25 A wide variety of other amine-reactive and 

thiol-reactive fluoirophores are available commercially 
(Molecular Probes, Inc., Eugene, OR, USA), including 
Alexa Fluor® 350, Alexa Fluor® 488, Alexa Fluor® 532, 
Alexa Fluor® 54 6, Alexa Fluor® 568, Alexa Fluor® 594, 

30 Alexa Fluor® 647 (monoclonal antibody labeling kits 

available from Molecular Probes, Inc., Eugene, OR, USA), 
BODIPY dyes, such as BODIPY 493/503, BODIPY FL, BODIPY 
R6G, BODIPY 530/550, BODIPY TMR, BODIPY 558/568, BODIPY 
558/568, BODIPY 564/570, BODIPY 576/589, BODIPY 581/591, 

35 BODIPY TR, BODIPY 630/650, BODIPY 650/665, Cascade Blue, 
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Cascade Yellow, Dansyl, lissamine rhodamdne B, Marina 
Blue, Oregon Green 4 88, Oregon Green 514, Pacific Blue, 
rhodamine 6G, rhodamine green, rhodamine red, 
tetramethylrhodamine, Texas Red (available from Molecular 
5 Probes, Inc., Eugene, OR, USA). 

The polypeptides of the present invention can 
also be conjugated to f luorophores , other proteins, and 
other macromolecules, using bifunctional linking 
reagents. 

10 Common homobi functional reagents include, e.g., 

APG, AEDP, BASED, BMB, BMDB, BMH, BMOE, BM[PEO]3, 
BM[PEO]4, BS3 , BSOCOES, DFDNB, DMA, DMP, DMS, DPDPB, DSG, 
DSP (Lomant's Reagent), DSS, DST, DTBP, DTME, DTSSP, EGS, 
HBVS, Sulfo-BSOCOES, Sulfo-DST, Sulfo-EGS (all available 

15 from Pierce, Rockford, IL, USA) ; common 

heterobifunctional cross- linkers include ABH, AMAS, 
ANB-NOS, APDP, ASBA, BMPA, BMPH, BMPS, EDC, EMCA, EMCH, 
EMCS, KMUA, KMUH, GMBS, LC-SMCC, LC-SPDP, MBS, M2C2H, 
MPBH, MSA, NHS -ASA, PDPH, PMPI, SADP, SAED, SAND, SANPAH, 

20 SASD, SATP, SBAP, SFAD, SIA, SIAB, SMCC, SMPB, SMPH, 
SMPT, SPDP, Sulfo-EMCS, Sulfo-GMBS, Sulfo-HSAB, 
Sulfo-KMUS, Sulfo-LC-SPDP, Sulfo-MBS, Sulf o-NHS-LC-ASA, 
Sulfo-SADP, Sulfo -SANPAH, Sulfo-SIAB, Sulfo-SMCC, 
Sulfo-SMPB, Sulfo-LC-SMPT, SVSB, TFCS (all available 

25 Pierce, Rockford, IL, USA) . 

The proteins, protein fragments, and protein 
fusions of the present invention can be conjugated, using 
such cross -linking reagents, to f luorophores that are not 
amine- or thiol -reactive . 

30 Other labels that usefully can be conjugated to 

the proteins, protein fragments, and fusion proteins of 
the present invention include radioactive labels, 
echosonographic contrast reagents, and MR I contrast 
agents . 
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The proteins, protein fragments, and protein 
fusions of the present invention can also usefully be 
conjugated using cross-linking agents to carrier 
proteins, such as KLH, bovine thyroglobulin, and even 
5 bovine serum albumin (BSA) , to increase immunogenicity 
for raising anti-AMLPl antibodies. 

The proteins, protein fragments, and protein 
fusions of the present invention can also usefully be 
conjugated to polyethylene glycol (PEG) ; PEGylation 

10 increases the serum half life of proteins administered 
intravenously for replacement therapy. Delgado et al . , 
Crit. Rev. Ther. Drug Carrier Syst. 9 (3-4) :249-304 
(1992); Scott et al . , Curr. Pharm. Des. 4(6):423-38 
(1998); DeSantis et al . , Curr. Opin. Biotechnol . 

15 10(4):324-30 (1999), incorporated herein by reference in 
their entireties. PEG monomers can be attached to the 
protein directly or through a linker, with PEGylation 
using PEG monomers activated with tresyl chloride 
(2 , 2 , 2- trif luoroethanesulphonyl chloride) permitting 

20 direct attachment under mild conditions. 

The isolated proteins of the present invention, 
including fusions thereof, can be produced by recombinant 
expression, typically using the expression vectors of the 
present invention as above-described or, if fewer than 

25 about 100 amino acids, by chemical synthesis (typically, 
solid phase synthesis) , and, on occasion, by in vitro 
translation. 

Production of the isolated proteins of the 
present invention can optionally be followed by 

30 purification. 

Purification of recombinantly expressed proteins is now 
well within the skill in the art. See, e.g., Thorner et 
al. (eds.), Applications of Chimeric Genes and Hybrid 
Proteins, Part A: Gene Expression and Protein 

35 Purification (Methods in Enzymology, Volume 326) , 
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Academic Press (2000), (ISBN: 0121822273); Harbin (ed.), 
Cloning, Gene Expression and Protein Purification : 
Experimental Procedures and Process Rationale , Oxford 
Univ. Press (2001) (ISBN: 0195132947); Marshak et al . , 
5 Strategies for Protein Purification and Characterization: 
A Laboratory Course Manual / Cold Spring Harbor Laboratory 
Press (1996) (ISBN: 0-87969-385-1); and Roe (ed .) , 
Protein Purification Applications , Oxford University 
Press (2001) , the disclosures of which are incorporated 

10 herein by reference in their entireties, and thus need 
not be detailed here. 

Briefly, however, if purification tags have 
been fused through use of an expression vector that 
appends such tag, purification can be effected, at least 

15 in part, by means appropriate to the tag, such as use of 
immobilized metal affinity chromatography for 
polyhistidine tags. Other techniques common in the art 
include ammonium sulfate fractionation, 

immunoprecipitation, fast protein liquid chromatography 

20 (FPLC) , high performance liquid chromatography (HPLC) , 
and preparative gel electrophoresis. 

Purification of chemically-synthesized peptides 
can readily be effected, e.g., by HPLC. 

Accordingly, it is an aspect of the present 

25 invention to provide the isolated proteins of the present 
invention in pure or substantially pure form. 

A purified protein of the present invention is 
an isolated protein, as above described, that is present 
at a concentration of at least 95%, as measured on a 

30 weight basis (w/w) with respect to total protein in a 

composition. Such purities can often be obtained during 
chemical synthesis without further purification, as, 
e.g., by HPLC. Purified proteins of the present 
invention can be present at a concentration (measured on 

35 a weight basis with respect to total protein in a 
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composition) of 96%, 97%, 98%, and even 99%. The 
proteins of the present invention can even be present at 
levels of 99.5%, 99.6%, and even 99.7%, 99.8%, or even 
99.9% following purification, as by HPLC. 

Although high levels of purity are particularly 
useful when the isolated proteins of the present 
invention are used as therapeutic agents - such as 
vaccines, or for replacement therapy - the isolated 
proteins of the present invention are also useful at 
lower purity. For example, partially purified proteins 
of the present invention can be used as immunogens to 
raise antibodies in laboratory animals. 

Thus, in another aspect, the present invention 
provides the isolated proteins of the present invention 
in substantially purified form. A "substantially 
purified protein" of the present invention is an isolated 
protein, as above described, present at a concentration 
of at least 70%, measured on a weight basis with respect 
to total protein in a composition. Usefully, the 
substantially purified protein is present at a 
concentration, measured on a weight basis with respect to 
total protein in a composition, of at least 75%, 80%, or 
even at least 85%, 90%, 91%, 92%, 93%, 94%, 94.5% or even 
at least 94.9%. 

In preferred embodiments, the purified and 
substantially purified proteins of the present invention 
are in compositions that lack detectable ampholytes, 
acrylamide monomers, bis-acrylamide monomers, and 
polyacrylamide . 

The proteins, fragments, and fusions of the 
present invention can usefully be attached to a 
substrate. The substrate can porous or solid, planar or 
non-planar; the bond can be covalent or noncovalent. 

For example, the proteins, fragments, and 
fusions of the present invention can usefully be bound to 
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a porous substrate, commonly a membrane, typically 
comprising nitrocellulose, polyvinyl idene fluoride 
(PVDF) , or cationically derivatized, hydrophilic PVDF; so 
bound, the proteins, fragments, and fusions of the 
5 present invention can be used to detect and quantify 

antibodies, e.g. in serum, that bind specifically to the 
immobilized protein of the present invention. 

As another example, the proteins, fragments, 
and fusions of the present invention can usefully be 

10 bound to a substantially nonporous substrate, such as 
plastic, to detect and quantify antibodies, e.g. in 
serum, that bind specifically to the immobilized protein 
of the present invention. Such plastics include 
polymethylacrylic , polyethylene , polypropylene , 

15 polyacrylate , polymethylmethacrylate , polyvinylchloride , 
polytetraf luoroethylene, polystyrene, polycarbonate, 
polyacetal , polysulf one , celluloseacetate , 
cellulosenitrate, nitrocellulose, or mixtures thereof; 
when the assay is performed in standard microtiter dish, 

20 the plastic is typically polystyrene. 

The proteins, fragments, and fusions of the 
present invention can also be attached to a substrate 
suitable for use as a surface enhanced laser desorption 
ionization source; so attached, the protein, fragment, or 

25 fusion of the present invention is useful for binding and 
then detecting secondary proteins that bind with 
sufficient affinity or avidity to the surf ace -bound 
protein to indicate biologic interaction therebetween. 
The proteins, fragments, and fusions of the present 

30 invention can also be attached to a substrate suitable 
for use in surface plasmon resonance detection; so 
attached, the protein, fragment, or fusion of the present 
invention is useful for binding and then detecting 
secondary proteins that bind with sufficient affinity or 
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avidity to the surface-bound protein to indicate 
biological interaction therebetween. 



Human AMLP1 Proteins 

5 

In a first series of protein embodiments, the 
invention provides an isolated AMLP1 polypeptide having 
the amino acid sequence in SEQ ID NO: 3 and SEQ ID NO: 6, 
which are full length human AMLPla and AMLPlb proteins, 

10 respectively. When used as immunogens, the full length 
proteins of the present invention can be used, inter 
alia, to elicit antibodies that bind to a variety of 
epitopes of the AMLP1 protein. 

The invention further provides fragments of the 

15 above -described polypeptides, particularly fragments 

having at least 6 amino acids, typically at least 8 amino 
acids, often at least 15 amino acids, and even the 
entirety of the sequence given in SEQ ID NO: 3 and SEQ ID 
NO: 6. 

20 The invention further provides fragments of at 

least 6 amino acids, typically at least 8 amino acids, 
often at least 15 amino acids, and even the entirety of 
the sequence given in SEQ ID NO: 8. 

As described above, the invention further 

25 provides proteins that differ in sequence from those 

described with particularity in the above -referenced SEQ 
ID NOs., whether by way of insertion or deletion, by way 
of conservative or moderately conservative substitutions, 
as hybridization related proteins, or as cross - 

30 hybridizing proteins, with those that substantially 
retain an AMLP1 activity particularly useful. 

The invention further provides fusions of the 
proteins and protein fragments herein described to 
heterologous polypeptides. 

35 
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ANTIBODIES AND ANT I BODY -PRODUCING CELLS 



In another aspect, the invention provides 
antibodies, including fragments and derivatives thereof, 
5 that bind specifically to AMLP1 proteins and protein 

fragments of the present invention or to one or more of 
the proteins and protein fragments encoded by the 
isolated AMLP1 nucleic acids of the present invention. 
The antibodies of the present invention can be specific 

10 for all of linear epitopes, discontinuous epitopes, or 
conformational epitopes of such proteins or protein 
fragments, either as present on the protein in its native 
conformation or, in some cases, as present on the 
proteins as denatured, as, e.g., by solubilization in 

15 SDS. 

In other embodiments, the invention provides 
antibodies, including fragments and derivatives thereof, 
the binding of which can be competitively inhibited by 
one or more of the AMLP1 proteins and protein fragments 

20 of the present invention, or by one or more of the 

proteins and protein fragments encoded by the isolated 
AMLP1 nucleic acids of the present invention. 

As used herein, the term "antibody" refers to a 
polypeptide, at least a portion of which is encoded by at 

25 least one immunoglobulin gene, which can bind 

specifically to a first molecular species, and to 
fragments or derivatives thereof that remain capable of 
such specific binding. 

By "bind specifically" and "specific binding" 

30 is here intended the ability of the antibody to bind to a 
first molecular species in preference to binding to other 
molecular species with which the antibody and first 
molecular species are admixed. An antibody is said 
specifically to "recognize" a first molecular species 
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when it can bind specifically to that first molecular 
species . 

As is well known in the art, the degree to 
which an antibody can discriminate as among molecular 

5 species in a mixture will depend, in part, upon the 

conformational relatedness of the species in the mixture; 
typically, the antibodies of the present invention will 
discriminate over adventitious binding to non-AMLPl 
proteins by at least two- fold, more typically by at least 

10 5-fold, typically by more than 10-fold, 25-fold, 50-fold, 
75-fold, and often by more than 100-fold, and on occasion 
by more than 500 -fold or 1000 -fold. When used to detect 
the proteins or protein fragments of the present 
invention, the antibody of the present invention is 

15 sufficiently specific when it can be used to determine 
the presence of the protein of the present invention in 
samples derived from human brain, liver, kidney, and 
adrenal gland as well as prostate, testis, lung, 
placenta, skeletal muscle, heart, bone marrow and colon 

20 tumor . 

Typically, the affinity or avidity of an 
antibody (or antibody mul timer, as in the case of an IgM 
pentamer) of the present invention for a protein or 
protein fragment of the present invention will be at 

25 least about 1 x 10" 6 molar (M) , typically at least about 
5 x 10" 7 M, usefully at least about 1 x 10* 7 M, with 
affinities and avidities of at least 1 x 10" 8 M, 
5 x 10" 9 M, and 1 x 10" 10 M proving especially useful. 

The antibodies of the present invention can be 

30 naturally-occurring forms, such as IgG, IgM, IgD, IgE, 
and IgA, from any mammalian species. 

Human antibodies can, but will infrequently, be 
drawn directly from human donors or human cells. In such 
case, antibodies to the proteins of the present invention 

35 will typically have resulted from fortuitous 
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immunization, such as autoimmune immunization, with the 
protein or protein fragments of the present invention. 
Such antibodies will typically, but will not invariably, 
be polyclonal. 

5 Human antibodies are more frequently obtained 

using transgenic animals that express human 
immunoglobulin genes, which transgenic animals can be 
affirmatively immunized with the protein immunogen of the 
present invention. Human Ig- transgenic mice capable of 

10 producing human antibodies and methods of producing human 
antibodies therefrom upon specific immunization are 
described, inter alia, in U.S. Patent Nos. 6,162,963; 
6,150,584; 6,114,598; 6,075,181; 5,939,598; 5,877,397; 
5,874,299; 5,814,318; 5,789,650; 5,770,429; 5,661,016; 

15 5,633,425; 5,625,126; 5,569,825; 5,545,807; 5,545,806, 
and 5,591,669, the disclosures of which are incorporated 
herein by reference in their entireties. Such antibodies 
are typically monoclonal, and are typically produced 
using techniques developed for production of murine 

20 antibodies. 

Human antibodies are particularly useful, and 
often preferred, when the antibodies of the present 
invention are to be administered to human beings as in 
vivo diagnostic or therapeutic agents, since recipient 

25 immune response to the administered antibody will often 
be substantially less than that occasioned by 
administration of an antibody derived from another 
species, such as mouse. 

IgG, IgM, IgD, IgE and IgA antibodies of the 

30 present invention are also usefully obtained from other 
mammalian species, including rodents — typically mouse, 
but also rat, guinea pig, and hamster - lagomorphs, 
typically rabbits, and also larger mammals, such as 
sheep, goats, cows, and horses. In such cases, as with 

35 the transgenic human-antibody-producing non-human 
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mammals, fortuitous immunization is not required, and the 
non-human mammal is typically affirmatively immunized, 
according to standard immunization protocols, with the 
protein or protein fragment of the present invention. 
5 As discussed above, virtually all fragments of 

8 or more contiguous amino acids of the proteins of the 
present invention can be used effectively as immunogens 
when conjugated to a carrier, typically a protein such as 
bovine thyroglobulin, keyhole limpet hemocyanin, or 

10 bovine serum albumin, conveniently using a bifunctional 
linker such as those described elsewhere above, which 
discussion is incorporated by reference here. 

Immunogenicity can also be conferred by fusion 
of the proteins and protein fragments of the present 

15 invention to other moieties. 

For example, peptides of the present invention 
can be produced by solid phase synthesis on a branched 
polylysine core matrix; these multiple antigenic peptides 
(MAPs) provide high purity, increased avidity, accurate 

20 chemical definition and improved safety in vaccine 
development. Tarn et al . , Proc. Natl. Acad. Sci. USA 
85:5409-5413 (1988); Posnett et al . , J. Biol. Chem. 263, 
1719-1725 (1988) . 

Protocols for immunizing non-human mammals are 

25 well-established in the art, Harlow et al . (eds.), 
Antibodies: A Laboratory Manual , Cold Spring Harbor 
Laboratory (1998) (ISBN: 0879693142); Coligan et al. 
(eds.), Current Protocols in Immunology , John Wiley & 
Sons, Inc. (2001) (ISBN: 0-471-52276-7); Zola, Monoclonal 

30 Antibodies : Preparation and Use of Monoclonal Antibodies 
and Engineered Antibody Derivatives (Basics : From 
Background to Bench) , Springer Verlag (2000) (ISBN: 
0387915907) , the disclosures of which are incorporated 
herein by reference, and often include multiple 

35 immunizations, either with or without adjuvants such as 
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Freund's complete adjuvant and Freund's incomplete 
adjuvant . 

Antibodies from nonhuman mammals can be 
polyclonal or monoclonal, with polyclonal antibodies 
5 having certain advantages in immunohistochemical 

detection of the proteins of the present invention and 
monoclonal antibodies having advantages in identifying 
and distinguishing particular epitopes of the proteins of 
the present invention. 

10 Following immunization, the antibodies of the 

present invention can be produced using any art -accepted 
technique. Such techniques are well known in the art, 
Coligan et al . (eds.), Current Protocols in Immunology , 
John Wiley & Sons, Inc. (2001) (ISBN: 0-471-52276-7); 

15 Zola, Monoclonal Antibodies ; Preparation and Use of 

Monoclonal Antibodies and Engineered Antibody Derivatives 
(Basics: From Background to Bench) , Springer Verlag 
(2000) (ISBN: 0387915907); Howard et al . (eds.), Basic 
Methods in Antibody Production and Characterization , CRC 

20 Press (2000) (ISBN: 0849394457); Harlow et al . (eds.), 
Antibodies: A Laboratory Manual , Cold Spring Harbor 
Laboratory (1998) (ISBN: 0879693142); Davis (ed.), 
Monoclonal Antibody Protocols , Vol. 45, Humana Press 
(1995) (ISBN: 0896033082); Delves (ed.), Antibody 

25 Production: Essential Techniques , John Wiley & Son Ltd 

(1997) (ISBN: 0471970107); Kenney, Antibody Solution: An 
Antibody Methods Manual , Chapman & Hall (1997) (ISBN: 
0412141914) , incorporated herein by reference in their 
entireties, and thus need not be detailed here. 

30 Briefly, however, such techniques include, 

inter alia, production of monoclonal antibodies by 
hybridomas and expression of antibodies or fragments or 
derivatives thereof from host cells engineered to express 
immunoglobulin genes or fragments thereof. These two 

35 methods of production are not mutually exclusive: genes 
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encoding antibodies specific for the proteins or protein 
fragments of the present invention can be cloned from 
hybridomas and thereafter expressed in other host cells. 
Nor need the two necessarily be performed together: 
5 e.g., genes encoding antibodies specific for the proteins 
and protein fragments of the present invention can be 
cloned directly from B cells known to be specific for the 
desired protein, as further described in U.S. Pat. No. 
5,627,052, the disclosure of which is incorporated herein 
10 by reference in its entirety, or from antibody-displaying 
phage . 

Recombinant expression in host cells is 
particularly useful when fragments or derivatives of the 
antibodies of the present invention are desired. 
15 Host cells for recombinant antibody 

production - either whole antibodies, antibody fragments, 
or antibody derivatives - can be prokaryotic or 
eukaryotic . 

Prokaryotic hosts are particularly useful for 
20 producing phage displayed antibodies of the present 
invention. 

The technology of phage -displayed antibodies, 
in which antibody variable region fragments are fused, 
for example, to the gene III protein (pill) or gene VIII 

25 protein (pVIII) for display on the surface of filamentous 
phage, such as M13, is by now well-established, Sidhu, 
Curr. Opin. Biotechnol . ll(6):610-6 (2000); Griffiths et 
al., Curr. Opin. Biotechnol. 9(1): 102-8 (1998); 
Hoogenboom et al., Immunotechnology, 4(l):l-20 (1998); 

30 Rader et al . , Current Opinion in Biotechnology 8:503-508 
(1997); Aujame et al . , Human Antibodies 8:155-168 (1997); 
Hoogenboom, Trends in Biotechnol. 15:62-70 (1997); de 
Kruif et al . , 17:453-455 (1996); Barbas et al . , Trends in 
Biotechnol. 14:230-234 (1996); Winter et al . , Ann. Rev. 

35 Immunol. 433-455 (1994), and techniques and protocols 
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required to generate, propagate, screen (pan), and use 
the antibody fragments from such libraries have recently 
been compiled, Barbas et al . , Phage Display: A Laboratory 
Manual , Cold Spring Harbor Laboratory Press (2001) (ISBN 
5 0-87969-546-3); Kay et al . (eds.), Phage Display of 
Peptides and Proteins: A Laboratory Manual , Academic 
Press, Inc. (1996); Abelson et al . (eds.), Combinatorial 
Chemistry , Methods in Enzymology vol. 267, Academic Press 
(May 1996), the disclosures of which are incorporated 

10 herein by reference in their entireties. 

Typically, phage -displayed antibody fragments 
are scFv fragments or Fab fragments; when desired, full 
length antibodies can be produced by cloning the variable 
regions from the displaying phage into a complete 

15 antibody and expressing the full length antibody in a 
further prokaryotic or a eukaryotic host cell. 

Eukaryotic cells are also useful for expression 
of the antibodies, antibody fragments, and antibody 
derivatives of the present invention. 

20 For example, antibody fragments of the present 

invention can be produced in Pichia pas tor is , Takahashi 
et al., Biosci. Biotechnol. Biochem. 64 (10) : 2138-44 
(2000); Freyre et al., J. Biotechnol. 76 (2-3) : 157-63 
(2000); Fischer et al., Biotechnol. Appl . Biochem. 30 (Pt 

25 2):117-20 (1999); Pennell et al., Res. Immunol. 

149 (6) :599-603 (1998); Eldin et al . , J. Immunol. Methods. 
201(1): 67-75 (1997); and in Saccharomyces cerevisiae, 
Frenken et al., Res. Immunol. 149 (6) : 589-99 (1998); 
Shusta et al . , Nature Biotechnol. 16(8):773-7 (1998), the 

30 disclosures of which are incorporated herein by reference 
in their entireties. 

Antibodies, including antibody fragments and 
derivatives, of the present invention can also be 
produced in insect cells, Li et al . , Protein Expr. Pur if . 
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21(1): 121-8 (2001); Ailor et al.,- Biotechnol . Bioeng. 
58 (2-3) :196-203 (1998); Hsu et al . , Biotechnol. Prog. 
13{1):96-104 (1997); Edelman et al . , Immunology 
91(1): 13-9 (1997); and Nesbit et al . , J. Immunol. 
5 Methods. 151 (1-2) : 201-8 (1992), the disclosures of which 
are incorporated herein by reference in their entireties. 

Antibodies and fragments and derivatives 
thereof of the present invention can also be produced in 
plant cells, Giddings et al., Nature Biotechnol. 

10 18 (11) : 1151-5 (2000); Gavilondo et al . , Biotechniques 
29(l):128-38 (2000); "Fischer et al., J". Biol. Regul. 
Homeost. Agents 14(2):83-92 (2000); Fischer et al., 
Biotechnol. Appl . Biochem. 30 (Pt 2):113-6 (1999); 
Fischer et al . , Biol. Chem. 380 (7-8) : 825-39 (1999); 

15 Russell, Curr. Top. Microbiol. Immunol. 240:119-38 
(1999); and Ma et al., Plant Physiol. 109(2): 341-6 
(1995) , the disclosures of which are incorporated herein 
by reference in their entireties. 

Mammalian cells useful for recombinant 

20 expression of antibodies, antibody fragments, and 

antibody derivatives of the present invention include CHO 
cells, COS cells, 293 cells, and myeloma cells. 

Verma et al . , J. Immunol. Methods 
216 (1-2) :165-81 (1998), review and compare bacterial, 

25 yeast, insect and mammalian expression systems for 
expression of antibodies. 

Antibodies of the present invention can also be 
prepared by cell free translation, as further described 
in Merk et al., J. Biochem. (Tokyo)'. 125 (2) : 328-33 (1999) 

30 and Ryabova et al., Nature Biotechnol. 15(l):79-84 

(1997) , and in the milk of transgenic animals, as further 
described in Pollock et al., J. Immunol. Methods 
231 (1-2) : 147-57 (1999), the disclosures of which are 
incorporated herein by reference in their entireties. 
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The invention further provides antibody 
fragments that bind specifically to one or more of the 
proteins and protein fragments of the present invention, 
to one or more of the proteins and protein fragments 
5 encoded by the isolated nucleic acids of the present 

invention, or the binding of which can be competitively 
inhibited by one or more of the proteins and protein 
fragments of the present invention or one or more of the 
proteins and protein fragments encoded by the isolated 

10 nucleic acids of the present invention. 

Among such useful fragments are Fab, Fab 1 , Fv, 
F(ab) ! 2 / and single chain Fv (scFv) fragments. Other 
useful fragments are described in Hudson, Curr. Opin. 
Biotechnol. 9(4):395-402 (1998). 

15 It is also an aspect of the present invention 

to provide antibody derivatives that bind specifically to 
one or more of the proteins and protein fragments of the 
present invention, to one or more of the proteins and 
protein fragments encoded by the isolated nucleic acids 

20 of the present invention, or the binding of which can be 
competitively inhibited by one or more of the proteins 
and protein fragments of the present invention or one or 
more of the proteins and protein fragments encoded by the 
isolated nucleic acids of the present invention. 

25 Among such useful derivatives are chimeric, 

primatized, and humanized antibodies; such derivatives 
are less immunogenic in human beings, and thus more 
suitable for in vivo administration, than are unmodified 
antibodies from non-human mammalian species. 

30 Chimeric antibodies typically include heavy 

and/or light chain variable regions (including both CDR 
and framework residues) of immunoglobulins of one 
species, typically mouse, fused to constant regions of 
another species, typically human. See, e.g., U.S. Pat. 

35 No. 5,807,715; Morrison et al . , Proc. Natl. Acad. Sci 
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USA. 81 (21) :6851-5 (1984); Sharon et al . , Nature 
309 (5966) : 364-7 (1984); Takeda et al . , Nature 
314 (6010) :452-4 (1985), the disclosures of which are 
incorporated herein by reference in their entireties. 
5 Prirnatized and humanized antibodies typically include 
heavy and/or light chain CDRs from a murine antibody 
grafted into a non-human primate or human antibody V 
region framework, usually further comprising a human 
constant region, Riechmann et al., Nature 332 (6162) : 323-7 

10 (1988); Co et al . , Nature 351 (6326) : 501-2 (1991); U.S. 
Pat. Nos. 6,054,297; 5,821,337; 5,770,196; 5,766,886; 
5,821,123; 5,869,619; 6,180,377; 6,013,256; 5,693,761; 
and 6,180,370, the disclosures of which are incorporated 
herein by reference in their entireties. 

15 Other useful antibody derivatives of the 

invention include heteromeric antibody complexes and 
antibody fusions, such as diabodies (bispecific 
antibodies), single-chain diabodies, and intrabodies. 

The antibodies of the present invention, 

20 including fragments and derivatives thereof, can usefully 
be labeled. It is, therefore, another aspect of the 
present invention to provide labeled antibodies that bind 
specifically to one or more of the proteins and protein 
fragments of the present invention, to one or more of the 

25 proteins and protein fragments encoded by the isolated 

nucleic acids of the present invention, or the binding of 
which can be competitively inhibited by one or more of 
the proteins and protein fragments of the present 
invention or one or more of the proteins and protein 

30 fragments encoded by the isolated nucleic acids of the 
present invention. 

The choice of label depends, in part, upon the 
desired use. 

For example, when the antibodies of the present 
35 invention are used for immunohistochemical staining of 
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tissue samples, the label can usefully be an enzyme that 
catalyzes production and local deposition of a detectable 
product . 

Enzymes typically conjugated to antibodies to 
5 permit their immunohistochemical visualization are well 
known, and include alkaline phosphatase, p-galactosidase , 
glucose oxidase, horseradish peroxidase (HRP) , and 
urease. Typical substrates for production and deposition 
of visually detectable products include 

10 o-nitrophenyl-beta-D-galactopyranoside (ONPG) ; 

o-phenylenediamine dihydrochloride (OPD) ; p-nitrophenyl 
phosphate (PNPP) ; p-nitrophenyl -beta-D-galactopryanoside 
(PNPG) ; 3 ' , 3 1 -diaminobenzidine (DAB) ; 3 -amino- 9- 
ethylcarbazole (AEC) ; 4-chloro-l-naphthol (CN) ; 

15 5-bromo-4-chloro-3-indolyl-phosphate (BCIP) ; ABTS®; 
BluoGal; iodonitrotetrazolium (INT); nitroblue 
tetrazolium chloride (NBT) ; phenazine methosulfate (PMS) ; 
phenolphthalein monophosphate (PMP) ; tetramethyl 
benzidine (TMB) ; tetranitroblue tetrazolium (TNBT) ; 

20 X-Gal; X-Gluc; and X-Glucoside. 

Other substrates can be used to produce 
products for local deposition that are luminescent. For 
example, in the presence of hydrogen peroxide (H 2 0 2 ) , 
horseradish peroxidase (HRP) can catalyze the oxidation 

25 of cyclic diacylhydrazides, such as luminol . Immediately 
following the oxidation, the luminol is in an excited 
state (intermediate reaction product), which decays to 
the ground state by emitting light. Strong enhancement of 
the light emission is produced by enhancers, such as 

30 phenolic compounds. Advantages include high sensitivity, 
high resolution, and rapid detection without 
radioactivity and requiring only small amounts of 
antibody. See, e.g., Thorpe et al . , Methods Enzymol . 
133:331-53 (1986); Kricka et al . , J". Immunoassay 

35 17<l):67-83 (1996); and Lundqvist et al . , J". Biolumin. 
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Chemilumin. 10(6):353-9 (1995), the disclosures of which 
are incorporated herein by reference in their entireties. 

Kits for such enhanced chemiluminescent detection (ECL) 
are available commercially. 
5 The antibodies can also be labeled using 

colloidal gold. 

As another example, when the antibodies of the 
present invention are used, e.g., for flow cytometric 
detection, for scanning laser cytometric detection, or 

10 for fluorescent immunoassay, they can usefully be labeled 
with f luorophores . 

There are a wide variety of fluorophore labels 
that can usefully be attached to the antibodies of the 
present invention . 

15 For flow cytometric applications, both for 

extracellular detection and for intracellular detection, 
common useful fluorophores can be fluorescein 
isothiocyanate (FITC) , allophycocyanin (APC) , R- 
phycoerythrin (PE) , peridinin chlorophyll protein 

20 (PerCP) , Texas Red, Cy3, Cy5, fluorescence resonance 

energy tandem fluorophores such as PerCP-Cy5.5, PE-Cy5, 
PE-Cy5.5, PE-Cy7, PE- Texas Red, and APOCy7. 

Other fluorophores include, inter alia, Alexa 
Fluor® 350, Alexa Fluor® 488, Alexa Fluor® 532, Alexa 

25 Fluor® 546, Alexa Fluor® 568, Alexa Fluor® 594, Alexa 
Fluor® 647 (monoclonal antibody labeling kits available 
from Molecular Probes, Inc., Eugene, OR, USA), BODIPY 
dyes, such as BODIPY 493/503, BODIPY FL, BODIPY R6G, 
BODIPY 530/550, BODIPY TMR, BODIPY 558/568, BODIPY 

30 558/568, BODIPY 564/570, BODIPY 576/589, BODIPY 581/591, 
BODIPY TR, BODIPY 630/650, BODIPY 650/665, Cascade Blue, 
Cascade Yellow, Dansyl, lissamine rhodamine B., Marina 
Blue, Oregon Green 488, Oregon Green 514, Pacific Blue, 
rhodamine 6G, rhodamine green, rhodamine red, 

35 tetramethylrhodamine, Texas Red (available from Molecular 
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Probes, Inc., Eugene, OR, USA), and Cy2 , Cy3 , Cy3.5, Cy5, 

Cy5.5, Cy7, all of which are also useful for 

f luorescently labeling the antibodies of the present 

invention. 

5 For secondary detection using labeled avidin, 

streptavidin, capt avidin or neutravidin, the antibodies 
of the present invention can usefully be labeled with 
biotin. 

When the antibodies of the present invention 
10 are used, e.g., for western blotting applications, they 

can usefully be labeled with radioisotopes, such as 33 P, 

32 P, 35 S, 3 H, and 125 I. 

As another example, when the antibodies of the 

present invention are used for radioimmunotherapy, the 
15 label can usefully be 228 Th, 227 Ac, 225 Ac, 223 Ra, 213 Bi, 2x2 Pb, 

212 Bi, 211 At, 203 Pb, 194 0s, 188 Re, 186 Re, 153 Sm, 149 Tb, 131 I, 125 I, 

X11 ln, 105 Rh, 99ra Tc, 97 Ru, 90 Y, 90 Sr, 88 Y, 72 Se, 67 Cu, or 47 Sc. 

As another example, when the antibodies of the 

present invention are to be used for in vivo diagnostic 
20 use, they can be rendered detectable by conjugation to 

MRI contrast agents, such as gadolinium 

diethylenetriaminepentaacetic acid (DTPA) , Lauf fer et 

al., Radiology 207 (2) : 529-38 (1998), or by radioisotopic 

labeling 

25 As would be understood, use of the labels 

described above is not restricted to the application as 
for which they were mentioned. 

The antibodies of the present invention, 
including fragments and derivatives thereof, can also be 

30 conjugated to toxins, in order to target the toxin's 

ablative action to cells that display and/or express the 
proteins of the present invention. Commonly, the 
antibody in such immunotoxins is conjugated to 
Pseudomonas exotoxin A, diphtheria toxin, shiga toxin A, 

35 anthrax toxin lethal factor, or ricin. See Hall (ed.), 
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Immunotoxin Methods and Protocols (Methods in Molecular 
Biology, Vol 166), Humana Press (2000) (ISBN: 0896037754) ; 
and Frankel et al . (eds.), Clinical Applications of 
Immunotoxins , Springer-Verlag New York, Incorporated 
5 (1998) (ISBN:3540640975) , the disclosures of which are 
incorporated herein by reference in their entireties, for 
review. 

The antibodies of the present invention can 
usefully be attached to a substrate, and it is, 

10 therefore, another aspect of the invention to provide 
antibodies that bind specifically to one or more of the 
proteins and protein fragments of the present invention, 
to one or more of the proteins and protein fragments 
encoded by the isolated nucleic acids of the present 

15 invention, or the binding of which can be competitively 
inhibited by one or more of the proteins and protein 
fragments of the present invention or one or more of the 
proteins and protein fragments encoded by the isolated 
nucleic acids of the present invention, attached to a 

20 substrate. 

Substrates can be porous or nonporous, planar 
or nonplanar. 

For example, the antibodies of the present 
invention can usefully be conjugated to filtration media, 
25 such as NHS-activated Sepharose or CNBr-activated 

Sepharose for purposes of immunoaf f inity chromatography. 

For example, the antibodies of the present 
invention can usefully be attached to paramagnetic 
microspheres, typically by biotin-streptavidin 
30 interaction, which microsphere can then be used for 

isolation of cells that express or display the proteins 
of the present invention. As another example, the 
antibodies of the present invention can usefully be 
attached to the surface of a microtiter plate for ELISA. 
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As noted above, the antibodies of the present 
invention can be produced in prokaryotic and eukaryotic 
cells. It is, therefore, another aspect of the present 
invention to provide cells that express the antibodies of 
5 the present invention, including hybridoma cells, B 
cells, plasma cells, and host cells recombinant ly 
modified to express the antibodies of the present 
invention. 

In yet a further aspect, the present invention 
10 provides aptamers evolved to bind specifically to one or 
more of the proteins and protein fragments of the present 
invention, to one or more of the proteins and protein 
fragments encoded by the isolated nucleic acids of the 
present invention, or the binding of which can be 
15 competitively inhibited by one or more of the proteins 
and protein fragments of the present invention or one or 
more of the proteins and protein fragments encoded by the 
isolated nucleic acids of the present invention. 

20 Human AMLP1 Antibodies 

In a first series of antibody embodiments, the 
invention provides antibodies, both polyclonal and 
monoclonal, and fragments and derivatives thereof, that 

25 bind specifically to a polypeptide having an amino acid 
sequence encoded by the cDNA, or that have the amino acid 
sequence in SEQ ID NO: 3, which are full length human 
AMLPla proteins. 

In another series of antibody embodiments, the 

30 invention provides antibodies, both polyclonal and 

monoclonal, and fragments and derivatives thereof, that 
bind specifically to a polypeptide having an amino acid 
sequence encoded by the cDNA, or that have the amino acid 
sequence in SEQ ID NO: 6, which are full length human 

35 AMLPlb proteins. 
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Such antibodies are useful in in vitro 
immunoassays, such as ELISA, western blot or 
immunohistochemical assay of disease tissue or cells. 
Such antibodies are also useful in isolating and 

5 purifying human AMLP1 proteins, including related cross- 
reactive proteins, by immunoprecipitation, immunoaf f inity 
chromatography, or magnetic bead-mediated purification. 

In another series of antibody embodiments, the 
invention provides antibodies, both polyclonal and 

10 monoclonal, and fragments and derivatives thereof, the 
specific binding of which can be competitively inhibited 
by the isolated proteins and polypeptides of the present 
invention. 

In other embodiments, the invention further 
15 provides the above -described antibodies detectably 
labeled, and in yet other embodiments, provides the 
above -described antibodies attached to a substrate. 

PHARMACEUTICAL COMPOSITIONS 

20 

Human AMLP1 plays an important role within the 
cell as an adaptor protein that interacts with both 
angiostatin-like protein and components of the actin 
cytoskeleton, and has anti-angiogenesis activity. 

25 Defects in human AMLP1 expression, activity, 

distribution, localization, and/or solubility are a cause 
of human disease, which disease can manifest as a 
disorder of brain, liver, kidney, adrenal gland, 
prostate, testis, lung, placenta, skeletal muscle, heart 

30 or bone marrow function. 

Accordingly, pharmaceutical compositions 
comprising nucleic acids, proteins, and antibodies of the 
present invention, as well as mimetics, agonists, 
antagonists, or inhibitors of AMLP1 activity, can be 
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administered as therapeutics for treatment of AMLP1 
defects. 

Thus, in another aspect, the invention provides 
pharmaceutical compositions comprising the nucleic acids, 
5 nucleic acid fragments, proteins, protein fusions, 
protein fragments, antibodies, antibody derivatives, 
antibody fragments, mimetics, agonists, antagonists, and 
inhibitors of the present invention. 

Such a composition typically contains from 

10 about 0.1 to 90% by weight of a therapeutic agent of the 
invention formulated in and/or with a pharmaceutically 
acceptable carrier or excipient. 

Pharmaceutical formulation is a well- 
established art, and is further described in Gennaro 

15 (ed.), Remington: The Science and Practice of Pharmacy , 
20 th ed., Lippincott, Williams & Wilkins (2000) (ISBN: 
0683306472); Ansel et al . , Pharmaceutical Dosage Forms 
and Drug Delivery Systems , 7 th ed., Lippincott Williams & 
Wilkins Publishers (1999) (ISBN: 0683305727); and Kibbe 

20 (ed.), Handbook of Pharmaceutical Excipients American 
Pharmaceutical Association, 3 rd ed. (2000) (ISBN: 
091733096X) , the disclosures of which are incorporated 
herein by reference in their entireties, and thus need 
not be described in detail herein. 

25 Briefly, however, formulation of the 

pharmaceutical compositions of the present invention will 
depend upon the route chosen for administration. The 
pharmaceutical compositions utilized in this invention 
can be administered by various routes including both 

30 enteral and parenteral routes, including oral, 

intravenous, intramuscular, subcutaneous, inhalation, 
topical, sublingual, rectal, intra-arterial, 
intramedullary, intrathecal , intraventricular, 
transmucosal , transdermal , intranasal , intraperitoneal , 

35 intrapulmonary, and intrauterine. 
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Oral dosage forms can be formulated as tablets, 



pills, dragees, capsules, liquids, gels, syrups, 
slurries, suspensions, and the like, for ingestion by the 
patient . 



administration can contain suitable carriers or 
excipients, such as carbohydrate or protein fillers, such 
as sugars, including lactose, sucrose, mannitol, or 
sorbitol; starch from corn, wheat, rice, potato, or other 
10 plants; cellulose, such as methyl cellulose, 
hydroxypropylmethyl -cellulose, sodium 

carboxymethylcellulose, or microcrystalline cellulose; 

gums including arabic and tragacanth; proteins such as 

gelatin and collagen; inorganics, such as kaolin, calcium 
15 carbonate, dicalcium phosphate, sodium chloride; and 

other agents such as acacia and alginic acid. 

Agents that facilitate disintegration and/or 

solubilization can be added, such as the cross-linked 

polyvinyl pyrrolidone, agar, alginic acid, or a salt 
20 thereof, such as sodium alginate, microcrystalline 

cellulose, corn starch, sodium starch glycolate, and 

alginic acid. 

Tablet binders that can be used include acacia, 

methylcellulose, sodium carboxymethylcellulose, 
25 polyvinylpyrrolidone (Povidone™) , hydroxypropyl 

methylcellulose, sucrose, starch and ethylcellulose . 

Lubricants that can be used include magnesium 

stearates, stearic acid, silicone fluid, talc, waxes, 

oils, and colloidal silica. 
30 Fillers, agents that facilitate disintegration 

and/or solubilization, tablet binders and lubricants, 

including the aforementioned, can be used singly or in 

combination. 



5 



Solid formulations of the compositions for oral 



Solid oral dosage forms need not be uniform 



35 



throughout . 
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For example, dragee cores can be used in 
conjunction with suitable coatings, such as concentrated 
sugar solutions, which can also contain gum arabic, talc, 
polyvinylpyrrolidone, carbopol gel, polyethylene glycol, 
5 and/or titanium dioxide, lacquer solutions, and suitable 
organic solvents or solvent mixtures. 

Oral dosage forms of the present invention 
include push- fit capsules made of gelatin, as well as 
soft, sealed capsules made of gelatin and a coating, such 

10 as glycerol or sorbitol. Push-fit capsules can contain 
active ingredients mixed with a filler or binders, such 
as lactose or starches, lubricants, such as talc or 
magnesium stearate, and, optionally, stabilizers. In 
soft capsules, the active compounds can be dissolved or 

15 suspended in suitable liquids, such as fatty oils, 

liquid, or liquid polyethylene glycol with or without 
stabilizers . 

Additionally, dyestuffs or pigments can be 
added to the tablets or dragee coatings for product 

20 identification or to characterize the quantity of active 
compound , i.e., dosage . 

Liquid formulations of the pharmaceutical 
compositions for oral (enteral) administration are 
prepared in water or other aqueous vehicles and can 

25 contain various suspending agents such as 

methylcellulose, alginates, tragacanth, pectin, kelgin, 
carrageenan, acacia, polyvinylpyrrolidone, and polyvinyl 
alcohol. The liquid formulations can also include 
solutions, emulsions, syrups and elixirs containing, 

30 together with the active compound (s), wetting agents, 
sweeteners, and coloring and flavoring agents. 

The pharmaceutical compositions of the present 
invention can also be formulated for parenteral 
administration. 
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For intravenous injection, water soluble 
versions of the compounds of the present invention are 
formulated in, or if provided as a lyophilate, mixed 
with, a physiologically acceptable fluid vehicle, such as 

5 5% dextrose ( ,! D5 M ), physiologically buffered saline, 0.9% 
saline, Hanks 1 solution, or Ringer's solution. 

Intramuscular preparations, e.g. a sterile 
formulation of a suitable soluble salt form of the 
compounds of the present invention, can be dissolved and 

10 administered in a pharmaceutical excipient such as Water- 
for-Injection, 0.9% saline, or 5% glucose solution. 
Alternatively, a suitable insoluble form of the compound 
can be prepared and administered as a suspension in an 
aqueous base or a pharmaceutical^ acceptable oil base, 

15 such as an ester of a long chain fatty acid (e.g., ethyl 
oleate) , fatty oils such as sesame oil, triglycerides, or 
liposomes . 

Parenteral formulations of the compositions can 
contain various carriers such as vegetable oils, 

20 dimethylacetamide, dimethyl formamide, ethyl lactate, 

ethyl carbonate, isopropyl myristate, ethanol, polyols 
(glycerol, propylene glycol, liquid polyethylene glycol, 
and the like) . 

Aqueous injection suspensions can also contain 

25 substances that increase the viscosity of the suspension, 
such as sodium carboxymethyl cellulose, sorbitol, or 
dextran. Non-lipid polycationic amino polymers can also 
be used for delivery. Optionally, the suspension can 
also contain suitable stabilizers or agents that increase 

30 the solubility of the compounds to allow for the 
preparation of highly concentrated solutions. 

Pharmaceutical compositions of the present 
invention can also be formulated to permit injectable, 
long-term, deposition. 
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The pharmaceutical compositions of the present 
invention can be administered topically. 

A topical semi-solid ointment formulation 
typically contains a concentration of the active 
5 ingredient from about 1 to 20%, e.g., 5 to 10%, in a 
carrier such as a pharmaceutical cream base. Various 
formulations for topical use include drops, tinctures, 
lotions, creams, solutions, and ointments containing the 
active ingredient and various supports and vehicles. In 
10 other transdermal formulations, typically in patch- 
delivered formulations, the pharmaceutically active 
compound is formulated with one or more skin penetrants, 
such as 2 -N-methyl -pyrrol idone (NMP) or Azone. 

Inhalation formulations can also readily be 
15 formulated. For inhalation, various powder and liquid 
formulations can be prepared. 

The pharmaceutically active compound in the 
pharmaceutical compositions of the present inention can 
be provided as the salt of a variety of acids, including 
20 but not limited to hydrochloric, sulfuric, acetic, 

lactic, tartaric, malic, and succinic acid. Salts tend 
to be more soluble in aqueous or other protonic solvents 
than are the corresponding free base forms . 

After pharmaceutical compositions have been 
25 prepared, they are packaged in an appropriate container 
and labeled for treatment of an indicated condition. 

The active compound will be present in an 
amount effective to achieve the intended purpose. The 
determination of an effective dose is well within the 
30 capability of those skilled in the art. 

A "therapeutically effective dose" refers to 
that amount of active ingredient - for example AMLP1 
protein, fusion protein, or fragments thereof, antibodies 
specific for AMLP1 , agonists, antagonists or inhibitors 
35 of AMLP1 - which ameliorates the signs or symptoms of the 
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disease or prevents progression thereof; as would be 
understood in the medical arts, cure, although desired, 
is not required. 

The therapeutically effective dose of the 

5 pharmaceutical agents of the present invention can be 
estimated initially by in vitro tests, such as cell 
culture assays, followed by assay in model animals, 
usually mice, rats, rabbits, dogs, or pigs. The animal 
model can also be used to determine an initial useful 

10 concentration range and route of administration. 

For example, the ED50 (the dose therapeutically 
effective in 50% of the population) and LD50 (the dose 
lethal to 50% of the population) can be determined in one 
or more cell culture of animal model systems. The dose 

15 ratio of toxic to therapeutic effects is the therapeutic 
index, which can be expressed as LD50/ED50. 
Pharmaceutical compositions that exhibit large 
therapeutic indices are particularly useful. 

The data obtained from cell culture assays and 

20 animal studies is used in formulating an initial dosage 
range for human use, and preferably provides a range of 
circulating concentrations that includes the ED50 with 
little or no toxicity. After administration, or between 
successive administrations, the circulating concentration 

25 of active agent varies within this range depending upon 
pharmacokinetic factors well known in the art, such as 
the dosage form employed, sensitivity of the patient, and 
the route of administration. 

The exact dosage will be determined by the 

30 practitioner, in light of factors specific to the subject 
requiring treatment. Factors that can be taken into 
account by the practitioner include the severity of the 
disease state, general health of the subject, age, 
weight, gender of the subject, diet, time and frequency 

35 of administration, drug combination (s) , reaction 
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sensitivities, and tolerance/response to therapy. 
Long-acting pharmaceutical compositions can be 
administered every 3 to 4 days, every week, or once every 
two weeks depending on half-life and clearance rate of 

5 the particular formulation. 

Normal dosage amounts may vary from 0.1 to 
100,000 micrograms, up to a total dose of about 1 g, 
depending upon the route of administration. Where the 
therapeutic agent is a protein or antibody of the present 

10 invention, the therapeutic protein or antibody agent 

typically is administered at a daily dosage of 0.01 mg to 
30 mg/kg of body weight of the patient (e.g., lmg/kg to 5 
mg/kg) . The pharmaceutical formulation can be 
administered in multiple doses per day, if desired, to 

15 achieve the total desired daily dose. 

Guidance as to particular dosages and methods 
of delivery is provided in the literature and generally 
available to practitioners in the art. Those skilled in 
the art will employ different formulations for 

20 nucleotides than for proteins or their inhibitors. 

Similarly, delivery of polynucleotides or polypeptides 
will be specific to particular cells, conditions, 
locations, etc. 

Conventional methods, known to those of 

25 ordinary skill in the art of medicine, can be used to 
administer the pharmaceutical formulation (s) of the 
present invention to the patient. The pharmaceutical 
compositions of the present invention can be administered 
alone, or in combination with other therapeutic agents or 

30 int ervent ions . 

THERAPEUTIC METHODS 

The present invention further provides methods 
35 of treating subjects having defects in AMLP1 - e.g., in 
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expression, activity, distribution, localization, and/or 
solubility of AMLP1 - which can manifest as a disorder of 
brain, liver, kidney, adrenal gland, prostate, testis, 
lung, placenta, skeletal muscle, heart or bone marrow 
5 function. As used herein, "treating" includes all 

medically-acceptable types of therapeutic intervention, 
including palliation and prophylaxis (prevention) of 
disease . 

In one embodiment of the therapeutic methods of 

10 the present invention, a therapeutically effective amount 
of a pharmaceutical composition comprising AMLP1 protein, 
fusion, fragment or derivative thereof is administered to 
a subject with a clinically-significant AMLP1 defect. 

Protein compositions are administered, for 

15 example, to complement a deficiency in native AMLP1. In 
other embodiments, protein compositions are administered 
as a vaccine to elicit a humoral and/or cellular immune 
response to AMLP1 . The immune response can be used to 
modulate activity of AMLP1 or, depending on the 

20 immunogen, to immunize against aberrant or aberrantly 
expressed forms, such as mutant or inappropriately 
expressed isoforms. In yet other embodiments, protein 
fusions having a toxic moiety are administered to ablate 
cells that aberrantly accumulate AMLP1 . 

25 In another embodiment of the therapeutic 

methods of the present invention, a therapeutically 
effective amount of a pharmaceutical composition 
comprising nucleic acid of the present invention is 
administered. The nucleic acid can be delivered in a 

30 vector that drives expression of AMLP1 protein, fusion, 
or fragment thereof, or without such vector. 

Nucleic acid compositions that can drive 
expression of AMLP1 are administered, for example, to 
complement a deficiency in native AMLP1, or as DNA 

35 vaccines. Expression vectors derived from virus, 
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replication deficient retroviruses, adenovirus, adeno- 
associated (AAV) virus, herpes virus, or vaccinia virus 
can be used - see, e.g., Cid-Arregui (ed.), Viral 
Vectors: Basic Science and Gene Therapy , Eaton Publishing 
5 Co., 2000 (ISBN: 188129935X) - as can plasmids. 

Antisense nucleic acid compositions, or vectors 
that drive expression of AMLPl antisense nucleic acids, 
are administered to downregulate transcription and/or 
translation of AMLPl in circumstances in which excessive 

10 production, or production of aberrant protein, is the 
pathophysiologic basis of disease. 

Antisense compositions useful in therapy can 
have sequence that is complementary to coding or to 
noncoding regions of the AMLPl gene. For example, 

15 oligonucleotides derived from the transcription 

initiation site, e.g., between positions -10 and +10 from 
the start site, are particularly useful. 

Catalytic antisense compositions, such as 
ribozymes, that are capable of sequence-specific 

20 hybridization to AMLPl transcripts, are also useful in 
therapy. See, e.g., Phylactou, Adv. Drug Del iv. Rev. 
44 (2-3) : 97-108 (2000); Phylactou et al . , Hum. Mol . Genet. 
7 (10) :1649-53 (1998); Rossi, Ciba Found. Symp. 
209:195-204 (1997); and Sigurdsson et al . , Trends 

25 Biotechnol. 13(8):286-9 (1995), the disclosures of which 
are incorporated herein by reference in their entireties. 

Other nucleic acids useful in the therapeutic 
methods of the present invention are those that are 
capable of triplex helix formation in or near the AMLPl 

30 genomic locus. Such triplexing oligonucleotides, are able 
to inhibit transcription, Intody et al . , Nucleic Acids 
Res. 28 (21) :4283-90 (2000); McGuffie et al . , Cancer Res. 
60(14) :3790-9 (2000), the disclosures of which are 
incorporated herein by reference, and pharmaceutical 

35 compositions comprising such triplex forming oligos 
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(TFOs) are administered in circumstances in which 
excessive production, or production of aberrant protein, 
is a pathophysiologic basis of disease. 

In another embodiment of the therapeutic 

5 methods of the present invention, a therapeutically 
effective amount of a pharmaceutical composition 
comprising an antibody (including fragment or derivative 
thereof) of the present invention is administered. As is 
well known, antibody compositions are administered, for 

10 example, to antagonize activity of AMLP1, or to target 
therapeutic agents to sites of AMLP1 presence and/or 
accumulat ion . 

In another embodiment of the therapeutic 
methods of the present invention, a pharmaceutical 

15 composition comprising a non-antibody antagonist of AMLP1 
is administered. Antagonists of AMLP1 can be produced 
using methods generally known in the art. In particular, 
purified AMLP1 can be used to screen libraries of 
pharmaceutical agents, often combinatorial libraries of 

20 small molecules, to identify those that specifically bind 
and antagonize at least one activity of AMLP1 . 

In other embodiments a pharmaceutical 
composition comprising an agonist of AMLP1 is 
administered. Agonists can be identified using methods 

25 analogous to those used to identify antagonists. 

In still other therapeutic methods of the 
present invention, pharmaceutical compositions comprising 
host cells that express AMLP1 , fusions, or fragments 
thereof can be administered. In such cases, the cells 

30 are typically autologous, so as to circumvent xenogeneic 
or allotypic rejection, and are administered to 
complement defects in AMLP1 production or activity. 

In other embodiments, pharmaceutical 
compositions comprising the AMLPl proteins, nucleic 

35 acids, antibodies, antagonists, and agonists of the 
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present invention can be administered in combination with 
other appropriate therapeutic agents. Selection of the 
appropriate agents for use in combination therapy can be 
made by one of ordinary skill in the art according to 

5 conventional pharmaceutical principles. The combination 
of therapeutic agents or approaches can act additively or 
synergistically to effect the treatment or prevention of 
the various disorders described above, providing greater 
therapeutic efficacy and/or permitting use of the 

10 pharmaceutical compositions of the present invention 

using lower dosages, reducing the potential for adverse 
side effects. 

TRANSGENIC ANIMALS AND CELLS 

15 

In another aspect, the invention provides 
transgenic cells and non-human organisms comprising AMLP1 
isoform nucleic acids, and transgenic cells and non-human 
organisms with targeted disruption of the endogenous 
20 orthologue of the AMLP1 gene. 

The cells can be embryonic stem cells or 
somatic cells. The transgenic non-human organisms can be 
chimeric, nonchimeric heterozygotes, and nonchimeric 
homozygotes . 

25 

DIAGNOSTIC METHODS 

The nucleic acids of the present invention can 
be used as nucleic acid probes to assess the levels of 
30 AMLPl mRNA in disease tissue or cells, and antibodies of 
the present invention can be used to assess the 
expression levels of AMLPl proteins in disease tissue or 
cells to diagnose cancer. 
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The following examples are offered for purpose 
of illustration, not limitation. 



EXAMPLE 1 

5 Identification and Characterization of 

cDNAs Encoding Human AMLP1 Proteins 

Bioinf ormatic algorithms were applied to human 
genomic sequence data to identify putative exons. Based 

10 on sequence information of one such exon, we identified a 
possible open reading frame. The predicted protein 
sequence from this potential ORF shares significant 
homology with GTPase activating proteins. 

Marathon- Ready™ adult testis cDNA (Clontech 

15 Laboratories, Palo Alto, CA, USA) was used as a substrate 
for standard RT-PCR to obtain cDNA clones that correspond 
to alternatively spliced forms of AMLP1 along with the 
forward primer ( 5 1 -AGCCCCAGTAGTCCTGTCCAGGTTCT-3' ; SEQ ID 
NO: 862) and the reverse primer (5 1 - 

20 TAGATGAGGACTTCCATCATCTCTCC-3' ; SEQ ID NO: 863). In 
addition, the same cDNA was used as a substrate for 
standard RACE (rapid amplification of cDNA ends) to 
obtain cDNA clones that correspond to the 3 1 end of the 
AMLP1 gene. Alternatively spliced forms of the gene span 

25 3.1 and 3.2 kilobases and appear to contain the entire 
coding region of the gene to which the exon contributes; 
for reasons described below, we termed these cDNAs AMLPla 
and AMLPlb, respectively. 

The AMLP1 isoform cDNAs were sequenced on both 

30 strands using a MegaBACE™ 1000 sequencer (Amersham 
Biosciences, Sunnyvale, CA, USA) . Sequencing both 
strands provided us with the exact chemical structure of 
the cDNAs, which are shown in FIG. 3 and FIG. 4 and 
further presented in the SEQUENCE LISTING as SEQ ID NO: 1 

35 and SEQ ID NO: 4, and placed us in actual physical 

possession of the entire set of single -base incremented 
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fragments of the sequenced clone, starting at the 5* and 
3* termini. 

Two overlapping cDNA products were cloned that 
together contained the complete sequence of AMLPla. 

5 Another two overlapping cDNA products were cloned that 

together contained the complete sequence of AMLPlb. These 
experiments placed us in possession of a complete set of 
fragments of the template. 

As shown in FIG. 3, the human AMLPla cDNA spans 

10 3173 nucleotides and contains an open reading frame from 
nucleotide 190 through and including nt 2799 (inclusive 
of termination codon) , predicting a protein of 869 amino 
acids with a (posttranslationally unmodified) molecular 
weight of 96.8 kD. The clone appears full length, with 

15 the reading frame opening starting with a methionine and 
terminating with a stop codon. 

As shown in FIG. 4, the human AMLPlb cDNA spans 
3248 nucleotides and contains an open reading frame from 
nucleotide 190 through and including nt 1878 (inclusive 

20 of termination codon) , predicting a protein of 562 amino 
acids with a (posttranslationally unmodified) molecular 
weight of 63.4 kD. The clone appears full length, with 
the reading frame opening starting with a methionine and 
terminating with a stop codon. 

25 BLAST query of genomic sequence identified one 

BACS, spanning 76 kb, that constitute the minimum set of 
clones encompassing the cDNA sequence. Based upon the 
known origin of the BACs (GenBank accession numbers 
AP001152.4, the human AMLP1 gene can be mapped to human 

30 chromosome llq21. 

Comparison of the cDNA and genomic sequences 
identified 12 exons for AMLPla, and an additional exon 
(exon 8) for AMLPlb. Exon organizations of AMLPla and 
AMLPlb are listed in Table 1 and Table 2, respectively. 
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Exon no. 


cDNA range 


genomic range 


BAC accession 


1 


1-130 


7552-7681 


AP001152.4 


2 


131-1049 


11911-12832 




3 


1050-1341 


34051-34342 




4 


1342-1486 


42571-42715 




5 


1487-1576 


43973-44062 




6 


1577-1722 


62634-62779 




7 


1723-1872 


66453-66602 




8 


1873-2063 


72045-72235 




9 


2064-2189 


77255-77380 




10 


2190-2416 


78462-78688 




11 


2417-2692 


81718-81993 




12 


2693-3141 


83209-83657 










Exon no. 


cDNA range 


genomic range 


BAC accession 


1 


1-130 


7552-7681 


AP001152.4 


2 


131-1052 


11911-12832 




3 


1053-1344 


34051-34342 




4 


1345-1489 


42571-42715 




5 


1490-1579 


43973-44062 




6 


1580-1725 


62634-62779 




7 


1726-1875 


66453-66602 




8 


1876-1941 


70184-70249 
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9 


1942-2132 


72045-72235 




10 


2133-2258 


77255-77380 




11 


2259-2485 


78462-78688 




12 


2486-2761 


81718-81993 




13 


2762-3214 


83209-83661 





Sequence comparison of AMLPla, AMLPlb cDNA as 
well as the AP001152.4 genomic clone revealed ten allelic 

5 variations between the three. The frequency of allelic 

variations for the AMLP1 gene observed here is similar to 
those reported in earlier studies that surveyed rates of 
SNP occurrence in a variety of genes. Cargill et al . , 
Nature Genet. 22:231-238 (1999). Most of the variations 

10 we observed are single nucleotide polymorphisms (SNPs) , 
with only one being a three base pair insertion in the 
AMLPlb cDNA. This insertion occurred within a 
trinucleotide (CAG) repeat, of which AMLPla contains 
seven copies. Table 3 lists the position, nucleotide 

15 sequence, as well as the amino acid sequence for the ten 
allelic variations. Analysis of such allelic variantions 
in a population can be used for association studies to 
establish gene -disease relationship. 



-i^^^^^^&tionB among. AMLPJa^ AMLPlb and AP001152.4. 


AMPLla 
Position 


AMPLlb 
Position 


AMPLla 
Allele 


AMLPla 
amino 
acid 


AMPLlb 
Allele 


AMLPlb 
amino 
acid 


AP001152 


222 


222 


G 


T 


A 


T 


A 


339 


339 


T 


N 


c 


N 


c 
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rVjamong AMLEla" AMLFlb and AP001152.4, 


AMPLla 
Position 


AMPLlb 
Position 


AMPLla 
Allele 


AMLPla 
amino 
acid 


AMPLlb 
Allele 


AMLPlb 
amino 
acid 


AP001152 


550 


550-552 






CAG 


Q 




700 


703 


A 


I 


G 


V 


G 


713 


716 


G 


S 


C 


T 


G \ 


788 


791 


A 


D 


G 


G 


G 


811 


814 


C 


P 


T 


S 


T 


1541 


1544 


G 


G 


A 


E 


A 


1603 


1606 


A 


K 


G 


E 


A 


2126 


2195 


A 


H 


G 


Non- 
coding 


A 



FIG. 2 schematizes the exon organization of the 
human AMLP1 clones. 
5 At the top is shown the bacterial artificial 

chromosome (BAC) , with GenBank accession numbers, that 
spans the human AMLP1 locus. 

As shown in FIG. 2, AMLPla encodes a longer 
open reading frames compared to AMLPlb, and a protein of 
10 869 amino acids. AMLPla is comprised of exons 1-12. 
Insertion of a 66 base pair exon in AMLPlb (exon 8 of 
AMLPlb) leads to frame shift and a shortened ORF with a 
protein of 563 amino acids. The predicted molecular 
weights for AMLPla and AMLPlb, prior to any post- 
15 translational modifications, are 96.8 and 63.4 kD, 
respectively. 

As further discussed in the examples herein, 
expression of AMLP1 was assessed using RT-PCR. RT-PCR 
detected high level expression of AMLP1 in brain, liver, 
20 kidney, and adrenal gland. AMLP1 expression is also 

detected in the other tissues tested, notably prostate, 
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testis, lung, placenta, skeletal muscle, heart, bone 
marrow as well as colon tumor. 

The sequences of the human AMLP1 cDNAs were 
used as a BLAST query into the GenBank nr and dbEst 

5 databases. The nr database includes all non-redundant 
GenBank coding sequence translations, sequences derived 
from the 3 -dimensional structures in the Brookhaven 
Protein Data Bank (PDB) , sequences from SwissProt, 
sequences from the protein information resource (PIR) , 

10 and sequences from protein research foundation (PRF) . 

The dbEst (database of expressed sequence tags) includes 
ESTs, short, single pass read cDNA (mRNA) sequences, and 
cDNA sequences from differential display experiments and 
RACE experiments. 

15 BLAST search identified multiple human ESTs, 

and four mouse ESTs as having sequence closely related to 
AMLP1 . 



Globally, the human AMLP1 proteins resembles 
20 the human angiomotin protein (GenBank accession: 

AAG01851.1, the AMLPla protein with 61 % amino acid 
identity and 78 % amino acid similarity over 471 amino 
acids) . AMLP1 also resembles a putative mouse transcript 
(GenBank accession: BAB30287.1, the AMLPla protein with 
25 88 % amino acid identity and 91 % amino acid similarity 
over the entire open reading frame) . AMLP1 aslo resembles 
a human putative transcript (GenBank accession: 
BAA76833.1, the AMLPla protein with 43 % amino acid 
identity and 56 % amino acid similarity over 807 amino 
30 acids) . 

Motif searches using Pfam 
(http://pfam.wustl.edu) , SMART (http://smart.embl- 
heidelberg.de), and PROSITE pattern and profile databases 
(http://www.expasy.ch/prosite), identified several known 
35 domains shared with human angiomotin. 
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FIG. 1 schematizes the protein domain structure 
of human AMLPla and AMLPlb, and the alignment of the 
myosin-tail motif of AMLPla with that of other proteins. 

As schematized in FIG. 1, the newly isolated 

5 gene products share certain protein domains and an 

overall structural organization with human angiomotin. 
The shared structural features strongly imply that human 
AMLP1 plays a role similar to that of human angiomotin as 
an adaptor protein that interacts with both angiostatin- 

10 like protein and components of the actin cytoskeleton and 
has anti-angiogenesis activity. 

Like human angiomotin, human AMLP1 contains a 
partial Myosin-tail domain. In AMLPla, the partial 
Myosin-tail motif ocurrs at amino acids 351-733 

15 (http://www.ncbi.nlm.gov/Structure/cdd/wrpsb.cgi). In the 
shorter AMLPlb protein, the partial Myosin-tail motif 
ends at amino acids sequence position 562 (which is the 
last amino acid for AMLPlb) . The Myosin-tail motif is 
represented by the coiled-coil myosin heavy chain tail 

20 region. The coiled-coil is composed of the tail from two 
molecules of myosin. These can then assemble into the 
macromolecular thick filament. The coiled-coil region 
provides the structural backbone of the thick filament. 

Other signatures of the newly isolated AMLP1 

25 proteins were identified by searching the PROSITE 

database (http://www.expasy.ch/tools/scnpsitl.html) . For 
AMLPla, these signatures include four N-glycosylation 
sites (51 - 54, 57 - 60, 631 - 634 and 635 - 638), one 
cAMP- and cGMP- dependent protein kinase phosphorylation 

30 site (405 - 408), twelve protein kinase C phosphorylation 
sites, seventeen Casein kinase II phosphorylation sites, 
six N-myristoylation sites (3 - 8, 194 - 199, 244 - 249, 
566 - 571, 743 - 748 and 784 - 789), as well as three 
tyrosine kinase phosphorylation sites (15 - 23, 453 - 459 

35 and 659 - 666) . For AMLPlb, these signatures include two 
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AT-glycosylation sites (51 - 54, 57 - 60), one cAMP- and 
cGMP-dependent protein kinase phosphorylation site (406 - 
409) , six protein kinase C phosphorylation sites, twelve 
Casein kinase II phosphorylation sites, three N- 
5 myristoylation sites (3 - 8, 195 - 200 and 245 - 250), as 
well as two tyrosine kinase phosphorylation sites (15 - 
23 and 454 - 460) . 



Possession of the genomic sequence permitted 
10 search for promoter and other control sequences for the 
human AMLP1 gene. 

A putative transcriptional control region, inclusive of 
promoter and downstream elements, was defined as 1 kb 
around the transcription start site, itself defined as 

15 the first nucleotide of the human AMLP1 cDNA clone. The 
region, drawn from sequence of BAC AP001152.4, has the 
sequence given in SEQ ID NO: 35, which lists 1000 
nucleotides before the transcription start site. 

Transcription factor binding sites were 

20 identified using a web based program 

(http://motif.genome.ad.jp/), including a binding site 
for TCF11 (789 - 801) , for Pbx-1 (846 - 854 bp) and for 
AP-1 (914 - 922, with numbering according to SEQ ID NO: 
35), amongst others. 

25 We have thus identified a newly described human 

gene, human AMLP1 (including two isoforms) , which shares 
certain protein domains and an overall structural 
organization with human angiomotin; the shared structural 
features strongly imply that the human AMLP1 protein 

30 plays a role similar to human angiomotin, as an adaptor 
protein that interacts with both angiostatin-like protein 
and components of the act in cytoskeleton and has anti- 
angiogenesis activity, making the human AMLP1 proteins 
and nucleic acids clinically useful diagnostic markers 

35 and potential therapeutic agents for cancer. 
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EXAMPLE 2 
Preparation and Labeling of 
Useful Fragments of Human AMLP1 

5 

Useful fragments of AMLP1 are produced by PCR, 
using standard techniques, or solid phase chemical 
synthesis using an automated nucleic acid synthesizer. 
Each fragment is sequenced, confirming the exact chemical 

10 structure thereof. 

The exact chemical structure of preferred 
fragments is provided in the attached SEQUENCE LISTING, 
the disclosure of which is incorporated herein by 
reference in its entirety. The following summary 

15 identifies the fragments whose structures are more fully 
described in the SEQUENCE LISTING: 



20 



25 



30 



35 



SEQ 


ID 


NO: 


1 


(nt, 


full length AMLPla cDNA) 


SEQ 


ID 


NO: 


2 


(nt, 


CDNA ORF of AMLPla) 


SEQ 


ID 


NO: 


3 


(aa, 


full length AMLPla protein) 


SEQ 


ID 


NO: 


4 


(nt, 


full length AMLPlb cDNA) 


SEQ 


ID 


NO: 


5 


(nt, 


CDNA ORF of AMLPlb) 


SEQ 


ID 


NO: 


6 


(aa, 


full length AMLPlb protein) 


SEQ 


ID 


NO: 


7 


(nt, 


(nt 409 - 795) portion of 












AMLPla) 


SEQ 


ID 


NO: 


8 


(aa, 


(aa 74 - 202) CDS entirely 












within SEQ ID NO: 7) 


SEQ 


ID 


NO: 


9 - 


• 20 


(nt, exons 1 - 12 of AMLPla 












(from genomic sequence) ) 


SEQ 


ID 


NO: 


21 


- 32 


(nt, 500 bp genomic amplicon 












centered about exons 1-12 












of AMLPla) 


SEQ 


ID 


NO: 


33 


(nt, 


novel exon of AMLPlb) 


SEQ 


ID 


NO: 


34 


(nt, 


500 bp genomic amplicon centered 












about novel exon of AMLPlb) 


SEQ 


ID 


NO: 


35 


(nt, 


1000 bp putative promoter) 
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SEQ ID NOs: 36 - 406 (nt, 17-mers scanning (nt 

409 - 795) portion of 
AMLPla) 

SEQ ID NOs: 407 - 769 (nt, 25-mers scanning (nt 
5 409 - 795) portion of 

AMLPla) 

SEQ ID NOs: 770 - 819 (nt, 17-mers scanning novel 

exon of AMLPlb) 
SEQ ID NOs: 820 - 8 61 (nt, 25-mers scanning novel 
10 exon of 

AMLPlb) 

SEQ ID NO: 862 (nt, forward primer for cloning of 

AMLP1 cDNA) 

SEQ ID NO: 863 (nt, reverse primer for cloning of 
15 AMLP1 cDNA) 

SEQ ID NO: 864 (nt, forward primer for expression 

analysis of AMLP1 by RT-PCR) 
SEQ ID NO: 865 (nt, reverse primer for expression 

analysis of AMLP1 by RT-PCR) 
20 SEQ ID NO: 866 (aa, consensus sequence of the 

Myosin-tail motif) 
SEQ ID NO: 867 (aa, sequence of the AMLPla Myosin- 
tail motif) 

SEQ ID NO: 868 (aa, sequence of the Myosin-tail 
25 motif of protein MHC A) 

SEQ ID NO: 869 (aa, sequence of the Myosin-tail 

motif of Myosin_HC) 
SEQ ID NO: 870 (aa, sequence of the Myosin-tail 

motif of protein MHC B) 



30 



Upon confirmation of the exact structure, each 
of the above -described nucleic acids of confirmed 
structure is recognized to be immediately useful as an 
AMLPl-specif ic probe. 
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For use as labeled nucleic acid probes, the 
above -described AMLP1 nucleic acids are separately- 
labeled by random priming. As is well known in the art 
of molecular biology, random priming places the 
5 investigator in possession of a near-complete set of 

labeled fragments of the template of varying length and 
varying starting nucleotide. 

The labeled probes are used to identify the 
AMLP1 gene on a Southern blot, and are used to measure 
10 expression of AMLP1 mRNA on a northern blot and by RT- 
PCR, using standard techniques. 



EXAMPLE 3 

15 RT-PCR Analysis of AMLP1 Expression 

The expression pattern of AMLP1 in human 
tissues was analyzed using RT-PCR. A forward primer (5'- 
TCAGAGGTGGAAATGAGAGGTTGG-3' ; SEQ ID NO: 864) and a 

20 reverse primer (5' -ACCGTATTGTCCACCTGGTGTTCT-3' ; SEQ ID 
NO: 865) - both derived from the open reading frame of 
AMLP1 - were used in standard RT-PCR. Sambrook et al . , 
Molecular cloning: 3rd edition, 2001. The cDNA emplates 
were obtained from brain, kidney, liver, testis, skeletal 

25 muscle, heart, bone marrow, lung, placenta, and prostate. 
The PCR reactions were carried out according to the 
following schedule: 94C, 20 seconds; 65C 20 seconds; 72C, 
60 seconds, for 35 cycles. PCR products were separated 
on an agarose gel and visualized with a Typhoon™ 

30 fluorimager and Imagequant software (Amersham 

Biosciences, Sunnyvale, CA, USA) . RT-PCR product for 
AMLP1 was found to be present most highly in brain, 
liver, kidney, and adrenal gland, but was also found at 
lower levels in the other tissues tested, notably 
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prostate, testis, lung, placenta, skeletal muscle, heart, 
bone marrow as well as colon tumor (FIG. 5) . 



EXAMPLE 4 

5 Production of Human AMLP1 Protein 

>\ 

The full length AMLP1 cDNA clone is cloned into 
the mammalian expression vector pcDNA3 . l/HISA 
(Invitrogen, Carlsbad, CA, USA) , transfected into COS7 
10 cells, transf ectants selected with G418, and protein 

expression in transf ectants confirmed by detection of the 

anti-Xpress™ epitope according to manufacturer's 
instructions. Protein is purified using immobilized 
metal affinity chromatography and vector-encoded protein 
15 sequence is then removed with enterokinase, per 

manufacturer's instructions, followed by gel filtration 
and/or HPLC. 

Following epitope tag removal, AMLP1 protein is 
present at a concentration of at least 70%, measured on a 

20 weight basis with respect to total protein (i.e., w/w) , 
and is free of acrylamide monomers, bis acrylamide 
monomers, polyacrylamide and ampholytes. Further HPLC 
purification provides AMLP1 protein at a concentration of 
at least 95%, measured on a weight basis with respect to 

25 total protein (i.e., w/w). 

EXAMPLE 5 

Production of Ant i -Human AMLP1 Antibody 

30 Purified proteins prepared as in Example 4 are 

conjugated to carrier proteins and used to prepare murine 
monoclonal antibodies by standard techniques. Initial 
screening with the unconjugated purified proteins, 
followed by competitive inhibition screening using 
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peptide fragments of the AMLP1 , identifies monoclonal 
antibodies with specificity for AMLP1 . 



EXAMPLE 6 

5 Use of Human AMLP1 Probes and Antibodies 

for Diagnosis of cancer 

After informed consent is obtained, samples are 
drawn from disease tissue or cells and tested for AMLP1 
10 mRNA levels by standard techniques and tested 

additionally for AMLP1 protein levels using anti- AMLP1 
antibodies in a standard ELISA. 



15 EXAMPLE 7 

Use of Human AMLP1 Nucleic Acids, 
Proteins, and Antibodies in Therapy 

Once over -expression of AMLP1 is detected in 
20 patients, AMLP1 antisense RNA or AMLPl-specif ic antibody 
is introduced into disease cells to reduce the amount of 
the protein. 

Once mutations of AMLP1 have been detected in 
patients, normal AMLP1 is reintroduced into the patient's 
25 disease cells by introduction of expression vectors that 
drive AMLP1 expression or by introducing AMLP1 proteins 
into cells. Antibodies for the mutated forms of AMLP1 are 
used to block the function of the abnormal forms of the 
protein. 



30 



EXAMPLE 8 
Human AMLP1 Disease Associations 



Diseases that map to the AMLP1 chromosomal 
35 region are shown in Table 4 . Mutations of AMLP1 might 
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lead to the disease (s) listed below. Alternatively, 
mutations of AMLP1 might lead to some other human 
disorder (s) as well. 



©as©©©©© 




OMIM No 


disease 


chromosomal 
location 


133780 


EXUDATIVE VITREORETINOPATHY 1 


Ilql3-q23 


213200 


CEREBELLAR ATAXIA 1 


Ilql4-q21 


603342 


SCHIZOPHRENIA 2 


Ilql4-q21 


603965 


FOCAL SEGMENTAL GLOMERULOSCLEROSIS 2 


Ilq21-q22 


165720 


OSTEOARTHRITIS 


llq 



5 



All patents, patent publications, and other 
published references mentioned herein are hereby 
incorporated by reference in their entireties as if each 

10 had been individually and specifically incorporated by 
reference herein. While preferred illustrative 
embodiments of the present invention are described, one 
skilled in the art will appreciate that the present 
invention can be practiced by other than the described 

15 embodiments, which are presented for purposes of 

illustration only and not by way of limitation. The 
present invention is limited only by the claims that 
follow. 
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WHAT IS CLAIMED IS: 

1. An isolated nucleic acid that encodes a human 
angiomotin-like protein, comprising: 

(a) a nucleotide sequence selected from the group 
consisting of : 

(i) SEQ ID NO: 1, SEQ ID NO: 4; 

(ii) the complement of the sequences set forth 
in (i) ; 

(iii) the nucleotide sequence of SEQ ID NO: 2, 
SEQ ID NO: 5; 

(iv) a degenerate variant of the sequences set 
forth in (iii) ; and 

(v) the complement of the sequences set forth 
in (iii) and (iv) ; or 

(b) a nucleotide sequence selected from the group 
consisting of : 

(i) a nucleotide sequence that encodes a 
polypeptide having the sequence of SEQ ID 
NO: 3, SEQ ID NO: 6; 

(ii) a nucleotide sequence that encodes a 
polypeptide having the sequence of SEQ ID 
NO: 3, SEQ ID NO : 6, with conservative 
amino acid substitutions; and 

(iii) the complement of the sequences set forth 
in (i) and (ii) , 

wherein said isolated nucleic acid comprising a 
nucleotide sequence selected from group (b) is no 
more than about 100 kb in length. 

2. The isolated nucleic acid of claim 1 wherein said 
nucleic acid, or the complement of said nucleic 
acid, encodes a polypeptide as an adaptor protein 
that interacts with both angiostatin-like protein 
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and components of the actin cytoskeleton and has 
anti-angiogenesis activity. 



3. The isolated nucleic acid of claim 1, wherein said 
nucleic acid, or the complement of said nucleic 
acid, is expressed in brain, liver, kidney, adrenal 
gland, prostate, testis, lung, placenta, skeletal 
muscle, heart, bone marrow or colon tumor. 

4. A nucleic acid probe, comprising: 

(a) the nucleic acid of claim 1; or 

(b) at least 17 contiguous nucleotides of SEQ ID 
NO: 7, SEQ ID NO: 33; 

wherein said probe according to (b) is no longer 
than about 100 kb in length. 

5. The probe of claim 4, wherein said probe is 
detectably labeled. 

6. The probe of claim 4, attached to a substrate. 

7. A microarray, wherein at least one probe of said 
array is a probe according to claim 4. 

8. The isolated nucleic acid molecule of claim 1, 
wherein said nucleic acid molecule is operably 
linked to one or more expression control elements. 

9. A replicable vector comprising a nucleic acid 
molecule of claim 1. 

10. A replicable vector comprising an isolated nucleic 
acid molecule of claim 8. 

11. A host cell transformed to contain the nucleic acid 
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progeny thereof. 



PCT/US02/35129 
10, or the 



12. A method for producing a polypeptide, the method 
comprising: culturing the host cell of claim 11 
under conditions in which the protein encoded by 
said nucleic acid molecule is expressed. 

13. An isolated polypeptide produced by the method of 
claim 12. 

14. An isolated polypeptide, comprising: 

(a) an amino acid sequence of SEQ ID NO: 3, SEQ ID 
NO: 6; 

(b) an amino acid sequence having at least 65% 
amino acid sequence identity to that of (a) ; 

(c) an amino acid sequence according to (a) in 
which at least 95% of deviations from the 
sequence of (a) are conservative substitutions; 
or 

(d) a fragment of at least 8 contiguous amino acids 
of any of (a) - (c) . 

15. A fusion protein, said fusion protein comprising a 
polypeptide of claim 14 fused to a heterologous 
amino acid sequence. 

16. The fusion protein of claim 15, wherein said 
heterologous amino acid sequence is a detectable 
moiety. 

17. The fusion protein of claim 16, wherein said 
detectable moiety is fluorescent. 



18. 



The fusion protein of claim 15, wherein said 
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19. An isolated antibody, or antigen-binding fragment or 
derivative thereof, the binding of which can be 
competitively inhibited by a polypeptide of claim 

14 . 

20. A transgenic non-human animal modified to contain 
the nucleic acid molecule of any one of claims 1 or 
8 - 10. 

21. A transgenic non-human animal unable to express the 
endogenous orthologue of the nucleic acid molecule 
of claim 1. 

22. A method of identifying agents that modulate the 
expression of AMLP1, the method comprising: 

contacting a cell or tissue sample believed to 
express AMLP1 with a chemical or biological agent, 
and then 

comparing the amount of AMLP1 expression in 
said cell or tissue sample with that of a control, 
changes in the amount relative to control 
identifying an agent that modulates expression of 
AMLP1. 

23. A method of identifying agonists and antagonists of 
AMLP1, the method comprising: 

contacting a cell or tissue sample believed to 
express AMLP1 with a chemical or biological agent, 
and then 

comparing the activity of AMLP1 with that of a 
control, increased activity relative to a control 
identifying an agonist, decreased activity relative 
to a control identifying an antagonist. 

148 



WO 03/037931 



PCT/US02/35129 



24. A purified agonist of the polypeptide of claim 14. 

25. A purified antagonist of the polypeptide of claim 
14. 

26. A method of identifying a specific binding partner 
for a polypeptide according to claim 14, the method 
comprising: 

contacting said polypeptide to a potential 
binding partner; and 

determining if the potential binding partner 
binds to said polypeptide. 

27. The method of claim 26 , wherein said contacting is 
performed in vivo. 

28. A purified binding partner of the polypeptide of 
claim 14 . 

29. A method for detecting a target nucleic acid in a 
sample, said target being a nucleic acid according 
to claim 1, the method comprising: 

(a) hybridizing the sample with a probe comprising 
at least 17 contiguous nucleotides of a 
sequence complementary to said target nucleic 
acid in said sample under high stringency 
hybridization conditions, and 

(b) detecting the presence or absence, and 
optionally the amount, of said binding. 

30. A method of diagnosing a disease caused by mutation 
in AMLP1, comprising: 
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detecting said mutation in a sample of nucleic 
acids that derives from a subject suspected to have 
said disease. 

31. A method of diagnosing or monitoring a disease 
caused by altered expression of AMLP1, comprising: 

determining the level of expression of AMLP1 in 
a sample of nucleic acids or proteins that derives 
from a subject suspected to have said disease, 
alterations from a normal level of expression 
providing diagnostic and/or monitoring information. 

32. A diagnostic composition comprising the nucleic acid 
of claim 1, said nucleic acid being detectably 
labeled. 

33. The diagnostic composition of claim 32, wherein said 
composition is further suitable for in vivo 
administration . 

34. A diagnostic composition comprising the polypeptide 
of claim 14, said polypeptide being detectably 
labeled. 

35. The diagnostic composition of claim 34, wherein said 
composition is further suitable for in vivo 
administration . 

36. A diagnostic composition comprising the antibody, or 
antigen-binding fragment or derivative thereof, of 
claim 19. 

37. The diagnostic composition of claim 36, wherein said 
antibody or antigen-binding fragment or derivative 
thereof is detectably labeled. 
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38. The diagnostic composition of claim 37, wherein said 
composition is further suitable for in vivo 
administration . 

39. A pharmaceutical composition comprising the nucleic 
acid of claim 1 and a pharmaceutically acceptable 
excipient . 

40. A pharmaceutical composition comprising the 
polypeptide of claim 14 and a pharmaceutically 
acceptable excipient. 

41. A pharmaceutical composition comprising the antibody 
or antigen-binding fragment or derivative thereof of 
claim 19 and a pharmaceutically acceptable 
excipient . 

42. A pharmaceutical composition comprising the agonist 
of claim 24 and a pharmaceutically acceptable 
excipient . 

43. A pharmaceutical composition comprising the 
antagonist of claim 25 and a pharmaceutically 
acceptable excipient . 

44. A method for treating or preventing a disorder 
associated with decreased expression or activity of 
AMLP1, the method comprising administering to a 
subject in need of such treatment an effective 
amount of the pharmaceutical composition of any of 
claims 39, 40 or 42. 

45. A method for treating or preventing a disorder 
associated with increased expression or activity of 
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AMLP1, the method comprising administering to a 
subject in need of such treatment an effective 
amount of the pharmaceutical composition of claim 41 
or 43 . 

46. A method of modulating the expression of a nucleic 
N acid according to claim 1, the method comprising: 

administering an effective amount of an agent 
which modulates the expression of a nucleic acid 
according to claim 1 . 

47. A method of modulating at least one activity of a 
polypeptide according to claim 14, the method 
comprising: 

administering an effective amount of an agent 
which modulates at least one activity of a 
polypeptide according to claim 14. 
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AMLPla 

nt: SEQ ID NO: 1 
aa: SEQ ID NO: 3 

AGCCCCAGTAGTCCTGTCCAGGTTCTAGAAGACTCCACCTACTTTTCCCCAG 5 2 

ACTTTCAGCTCTATTCTGGGAGGCATGAAACATCTGCTTTGACGGTGGAGGC 104 

AACCAGTAGCATCAGGGAAAAAGTTGTTGAAGATCCTCTTTGTAACTTCCAC 156 

M R G S 4 

TCCCCAAACTTCCTGAGGATCTCAGAGGTGGAA ATG AGA GGT TCC 201 

EDAAAGTVLQRLI 17 

GAG GAT GCG GCA GCT GGA ACG GTA TTG CAG CGG CTG ATC 240 

QEQLRYGTPTENM 30 

CAG GAA CAA CTG CGG TAT GGC ACC CCA ACC GAG AAC ATG 279 

NLLAI QHQATGSA 43 

AAC TTG CTG GCC ATT CAG CAC CAG GCC ACA GGG AGT GCA 318 

GPAHPTNNFSSTE 56 

GGA CCA GCC CAT CCT ACA AAT AAC TTT TCT TCC ACG GAA 357 

NLTQEDPQMVYQS 69 

AAC CTC ACT CAA GAA GAC CCA CAA ATG GTC TAC CAG TCA 396 

ARQEPQGQEHQVD 82 

GCA CGC CAA GAA CCG CAG GGT CAA GAA CAC CAG GTG GAC 435 

NTVMEKQVRSTQP 95 

AAT ACG GTG ATG GAG AAA CAG GTC CGG TCC ACG CAG CCT 474 

QQNNEELPTYEEA 108 

CAG CAG AAC AAC GAG GAA CTG CCC ACT TAC GAG GAG GCC 513 
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KAQSQFFRGQQQQ 121 

AAA GCA CAG TCG CAG TTC TTC AGG GGG CAG CAG CAG CAA 552 

QQQGAVGHGYYMA 134 

CAG CAG CAG GGG GCG GTG GGC CAT GGT TAC TAC ATG GCA 591 

GGTSQKSRTEGRP 147 

GGG GGC ACC AGT CAG AAG TCC CGA ACT GAG GGG AGG CCC 630 

TVNRANSGQAHKD 160 

ACT GTG AAC CGT GCC AAC AGT GGA CAG GCG CAT AAG GAC 669 

EALKELKQGHIRS 173 

GAG GCG CTG AAG GAA CTG AAG CAG GGC CAC ATC CGC TCG 708 

LSERIM QLSLERN 186 

CTC AGC GAG AGA ATC ATG CAG CTG TCC CTG GAG AGG AAT 747 

GAKQHLPGSGNGK 199 

GGG GCC AAG CAA CAC CTT CCC GGC TCG GGG AAT GGA AAG 786 

DFKVGGGPPPAQP 212 

GAC TTC AAA GTA GGA GGG GGG CCC CCC CCT GCC CAG CCT 825 

AGKVLDPRGPPPE 225 

GCA GGT AAA GTG CTG GAC CCT CGG GGT CCT CCA CCT GAG 864 

YPFKTKQMMS PVS 238 

TAC CCC TTC AAG ACC AAG CAA ATG ATG TCC CCA GTC AGC 903 

KTQEHGLFYGDQH 251 

AAG ACC CAG GAG CAC GGA CTT TTT TAT GGT GAC CAG CAC 942 

PGMLHEMVKPY PA 264 

CCC GGG ATG CTC CAC GAG ATG GTC AAG CCC TAC CCT GCT 981 

PQPVRTDVAVLRY 277 

CCT CAG CCT GTG AGA ACA GAT GTG GCC GTC CTG CGG TAC 102 0 
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QPPPEYGVTSRPC 290 

CAG CCA CCC CCT GAG TAT GGG GTA ACG AGC CGC CCA TGC 1059 

QLPFPSTMQQHSP 303 

CAA CTT CCG TTC CCA TCA ACC ATG CAG CAG CAC AGC CCC 1098 

MSSQTSSASGPLH 316 

ATG TCC TCC CAG ACC TCT TCC GCC AGC GGG CCA CTG CAC 1137 

SVSLPLPLPMALG 329 

TCT GTC TCC CTG CCG CTT CCA CTC CCG ATG GCC CTG GGT 1176 

APQPPPAASPSQQ 342 

GCT CCA CAG CCC CCG CCT GCC GCC TCC CCC AGC CAG CAG 1215 

LGPDAFAIVERAQ 355 

CTT GGT CCA GAT GCC TTT GCG ATT GTG GAG CGA GCC CAG 1254 

QMVEI LTEENRVL 368 

CAA ATG GTG GAG ATA TTA ACA GAG GAG AAC CGG GTG CTT 12 93 

HQELQGYYDNADK 381 

CAC CAG GAA CTT CAG GGT TAC TAC GAC AAT GCC GAC AAG 1332 

LHKFEKELQRISE 394 

CTC CAC AAG TTT GAA AAA GAA CTT CAG AGA ATT TCG GAA 1371 

AYESLVKSTTKRE 407 

GCC TAT GAA AGT CTG GTC AAG TCT ACC ACC AAG CGA GAA 1410 

SLDKAMRNKLEGE 420 

TCG CTG GAC AAG GCC ATG AGA AAC AAA TTG GAA GGC GAG 1449 

IRRLHDFNRDLRD 433 

ATT AGA AGA CTT CAT GAT TTC AAC AGA GAC CTC CGA GAT 1488 

RLETANRQLSSRE 446 

CGA CTA GAG ACT GCT AAC AGG CAA CTA TCC AGC AGG GAA 152 7 
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YEGHGDKAAEGHY 459 

TAC GAA GGG CAT GGA GAC AAA GCT GCA GAG GGG CAT TAT 1566 

ASQNKEFLKEKEK 472 

GCT TCC CAG AAC AAA GAA TTC TTG AAG GAA AAG GAG AAA 1605 

LEMELAAVRTASE 485 

TTA GAA ATG GAG TTA GCA GCA GTG CGG ACT GCA AGT GAG 1644 

DHRRHI E ILDQAL 498 

GAC CAT CGG AGA CAC ATC GAG ATC CTG GAC CAG GCT TTG 1683 

SNAQARVIKLEEE 511 

AGC AAC GCC CAG GCC AGG GTC ATC AAG CTG GAA GAG GAG 1722 

LREKQAYVEKVEK 524 

TTA CGA GAG AAG CAA GCA TAT GTT GAG AAA GTT GAG AAG 1761 

LQQALTQLQSACE 537 

CTG CAG CAG GCC CTG ACC CAG CTG CAG TCT GCA TGT GAG 1800 

KREQMERRLRTWL 550 

AAG CGA GAA CAG ATG GAG CGG AGA CTG CGG ACT TGG CTG 1839 

ERELDALRTQQKH 563 

GAG AGA GAG CTG GAT GCA CTG AGA ACC CAG CAG AAA CAT 1878 

GNGQPANMPEYNA 576 

GGA AAT GGC CAG CCA GCC AAC ATG CCG GAA TAC AAT GCC 1917 

PALLELVREKEER 589 

CCA GCC CTC CTG GAA CTT GTG CGG GAG AAG GAG GAG CGG 1956 

ILALEADMTKWEQ 602 

ATC CTG GCC CTG GAG GCC GAC ATG ACA AAG TGG GAG CAG 1995 

KYLEESTIRHFAM 615 

AAG TAC CTG GAG GAG AGC ACC ATC CGA CAC TTT GCC ATG 2034 

NAAATAAAE RDTT 628 

AAT GCC GCA GCC ACT GCA GCA GCT GAG AGG GAC ACC ACG 2073 
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IINHSRNGSYGES 641 

ATC ATC AAC CAC TCA CGG AAT GGC AGC TAC GGA GAG AGC 2112 

SLEAHIWQEEEEV 654 

TCG CTG GAG GCC CAC ATC TGG CAA GAG GAG GAG GAG GTG 2151 

VQANRRCQDMEYT 667 

GTG CAG GCC AAC AGA AGG TGT CAG GAC ATG GAA TAC ACT 2190 

IKNLHAKI IEKDA 680 

ATT AAA AAT CTC CAT GCC AAA ATC ATA GAG AAA GAT GCT 2229 

MIKVLQQRSRKDA 693 

ATG ATT AAG GTC CTG CAG CAG CGA TCT CGT AAA GAT GCC 2268 

GKTDSSSLRPARS 706 

GGG AAG ACA GAC TCC TCC AGC CTA CGT CCT GCC CGC TCC 23 07 

VPS IAAAT GTHSR 719 

GTT CCA TCC ATA GCA GCA GCT ACT GGG ACA CAC TCT CGC 2346 

QTSLTSSQLAEEK 732 

CAG ACC TCT CTT ACC AGC AGC CAG CTG GCT GAG GAA AAG 2385 

KEEKTWKGS IGLL 745 

AAG GAA GAG AAG ACC TGG AAG GGG AGC ATA GGA TTG CTG 2424 

LGKEHHEHASAPL 758 

CTG GGG AAG GAG CAC CAT GAG CAT GCC TCT GCC CCA CTG 2463 

LPPPPTSALSSIA 771 

CTG CCA CCC CCA CCC ACC TCA GCA CTG TCC TCC ATA GCC 2502 

STTAASSAHAKTG 784 

TCC ACT ACG GCA GCC AGC AGT GCC CAC GCC AAG ACA GGC 2 541 

S'KDSSTQTDKSAE 797 

AGC AAG GAC AGC AGC ACA CAG ACT GAC AAG AGT GCC GAG 2580 

LFWPSMASLPSRG 810 

CTC TTC TGG CCC AGC ATG GCC TCC CTT CCC AGC CGC GGC 2619 
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RLSTTPAHSPVLK 823 

CGG CTG AGC ACG ACC CCT GCT CAC AGC CCC GTC CTG AAA 2658 

HPAAKGTAEKLEN 836 

CAC CCA GCG GCC AAA GGG ACC GCA GAG AAA CTG GAG AAC 2697 

SPGHGKSPDHRGR 849 

TCT CCT GGC CAT GGG AAG TCG CCT GAC CAC AGA GGC CGG 2736 

VSSLLHKPEFPDG 862 

GTC AGC AGC TTG CTG CAC AAG CCC GAG TTC CCT GAT GGA 2775 

EMMEVLI * 870 

GAG ATG ATG GAA GTC CTC ATC TAA CTGCCATCCCTGTGGAATT 2818 

TCAGTACAGAACACTGACAAACAAGGAAAGCGGCAGAGAAGGAAGAAAGACC 2870 

TAGAAGGTTGTAGATGGGAAATCAGGAATGATTTGAACTGATAAAGATTTCA 2922 

GACTCATAAGAACACATTTTATAAATGTTAAACACTU^AAACTACA^ 2974 

AGATAGAAGAGAATGCGATGGATTTTATTACACATGGTGGAAGAGAGAAGAG 3026 

GCGTGTAGGTTTGCAAACA/VAGTTAAGAAATAGGAAACTG/^ATTTTTCATTG 3078 

TACAGAAAATGTATCTCTTGGGGAGGGCCTGTGTACCCCCATTCTCTGATTA 313 0 

TAAACAGATAACCCTU^AAAAAAAAAAAAAAAAAAAAAAAAAAG 3173 
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nt: SEQ ID NO: 4 
aa: SEQ ID NO: 6 

AGCCCCAGTAGTCCTGTCCAGGTTCTAGAAGACTCCACCTACTTTTCCCCAG 52 

ACTTTCAGCTCTATTCTGGGAGGCATGAAACATCTGCTTTGACGGTGGAGGC 104 

AACCAGTAGCATCAGGGAAAAAGTTGTTGAAGATCCTCTTTGTAACTTCCAC 156 

M R G S 4 

TCCCCAAACTTCCTGAGGATCTCAGAGGTGGAA ATG AGA GGT TCC 201 

EDAAAGTVLQRL I 17 

GAG GAT GCG GCA GCT GGA ACA GTA TTG CAG CGG CTG ATC 240 

QEQLRYGTPTENM 30 

CAG GAA CAA CTG CGG TAT GGC ACC CCA ACC GAG AAC ATG 279 

NLLAIQHQATGSA 43 

AAC TTG CTG GCC ATT CAG CAC CAG GCC ACA GGG AGT GCA 318 

GPAHPTNNFSSTE 56 

GGA CCA GCC CAT CCT ACA AAC AAC TTT TCT TCC ACG GAA 357 

NLTQED PQMV YQS 69 

AAC CTC ACT CAA GAA GAC CCA CAA ATG GTC TAC CAG TCA 396 

ARQEPQGQEHQVD 82 

GCA CGC CAA GAA CCG CAG GGT CAA GAA CAC CAG GTG GAC 435 

NTVMEKQVRSTQP 95 

AAT ACG GTG ATG GAG AAA CAG GTC CGG TCC ACG CAG CCT 474 

QQNNEE LPTYEEA 108 

CAG CAG AAC AAC GAG GAA CTG CCC ACT TAC GAG GAG GCC 513 
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K A Q S Q 
AAA GCA CAG TCG CAG 

Q Q Q Q G 
CAA CAG CAG CAG GGG 

A G G T S 
GCA GGG GGC ACC AGT 

P T V N R 
CCC ACT GTG AAC CGT 

D E A L K 
GAC GAG GCG CTG AAG 

S L T E R 
TCG CTC ACC GAG AGA 

N G A K Q 
AAT GGG GCC AAG CAA 

K G F K V 
AAG GGC TTC AAA GTA 

P A G K V 
CCT GCA GGT AAA GTG 

E Y P F K 
GAG TAC CCC TTC AAG 

S K T Q E 
AGC AAG ACC CAG GAG 

H P G M L 
CAC CCC GGG ATG CTC 

A P Q P V 
GCT CCT CAG CCT GTG 



F F R G Q 
TTC TTC AGG GGG CAG 

A V G H G 
GCG GTG GGC CAT GGT 

Q K S R T 
CAG AAG TCC CGA ACT 

A N S G Q 
GCC AAC AGT GGA CAG 

D L K Q G 
GAT CTG AAG CAG GGC 

I M Q L S 
ATC ATG CAG CTG TCC 

H L P . G S 
CAC CTT CCC GGC TCG 

G G G P S 
GGA GGG GGG CCC TCC 

L D P R G 

CTG GAC CCT CGG GGT 

T K Q M M 
ACC AAG CAA ATG ATG 

H G L F Y 
CAC GGA CTT TTT TAT 

H E M V K 
CAC GAG ATG GTC AAG 

R T D V A 
AGA ACA GAT GTG GCC 



Q Q Q 121 

CAG CAG CAG 552 

Y Y M 134 
TAC TAC ATG 591 

E G R 147 

GAG GGG AGG 630 

A H K 160 

GCG CAT AAG 669 

H V R 173 

CAC GTC CGC 708 

L E R 186 

CTG GAG AGG 747 

G N G 199 

GGG AAT GGA 786 

P A Q 212 

CCT GCC CAG 825 

P P P 225 

CCT CCA CCT 864 

S P V 238 

TCC CCA GTC 903 

G D Q 251 

GGT GAC CAG 942 

P Y P 264 

CCC TAC CCT 981 

V L R 277 
GTC CTG CGG 1020 
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YQPPPEYGVTSRP 290 

TAC CAG CCA CCC CCT GAG TAT GGG GTA ACG AGC CGC CCA 1059 

CQLPFPSTMQQHS 303 

TGC CAA CTT CCG TTC CCA TCA ACC ATG CAG CAG CAC AGC 1098 

PMSSQTSSASGPL 316 

CCC ATG TCC TCC CAG ACC TCT TCC GCC AGC GGG CCA CTG 1137 

HSVSLPLPLPMAL 329 

CAC TCT GTC TCC CTG CCG CTT CCA CTC CCG ATG GCC CTG 1176 

GAPQPPPAASPSQ 342 

GGT GCT CCA CAG CCC CCG CCT GCC GCC TCC CCC AGC CAG 1215 

QLGPDAFAIVERA 355 

CAG CTT GGT CCA GAT GCC TTT GCG ATT GTG GAG CGA GCC 1254 

QQMVE I LTEENRV 368 

CAG CAA ATG GTG GAG ATA TTA ACA GAG GAG AAC CGG GTG 1293 

LHQELQGYYDNAD 381 

CTT CAC CAG GAA CTT CAG GGT TAC TAC GAC AAT GCC GAC 1332 

KLHKFEKELQRIS 394 

AAG CTC CAC AAG TTT GAA AAA GAA CTT CAG AGA ATT TCG 1371 

EAYESLVKSTTKR 407 

GAA GCC TAT GAA AGT CTG GTC AAG TCT ACC ACC AAG CGA 1410 

ESLDKAMRNKLEG 420 

GAA TCG CTG GAC AAG GCC ATG AGA AAC AAA TTG GAA GGC 1449 

EIRRLHDFNRDLR 433 

GAG ATT AGA AGA CTT CAT GAT TTC AAC AGA GAC CTC CGA 1488 

DRLETANRQLSSR 446 

GAT CGA CTA GAG ACT GCT AAC AGG CAA CTA TCC AGC AGG 1527 

EYEGHEDKAAEGH 459 

GAA TAC GAA GGG CAT GAA GAC AAA GCT GCA GAG GGG CAT 1566 
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YASQNKEFLKEKE 472 

TAT GCT TCC CAG AAC AAA GAA TTC TTG AAG GAA AAG GAG 1605 

ELEMELAAVRTAS 485 

GAA TTA GAA ATG GAG TTA GCA GCA GTG CGG ACT GCA AGT 1644 



EDHRRHIEILDQA 498 
GAG GAC CAT CGG AGA CAC ATC GAG ATC CTG GAC CAG GCT 1683 



LSNAQARVIKLEE 511 

TTG AGC AAC GCC CAG GCC AGG GTC ATC AAG CTG GAA GAG 1722 

ELREKQAYVEKVE 524 

GAG TTA CGA GAG AAG CAA GCA TAT GTT GAG AAA GTT GAG 1761 



KLQQALTQLQSAC 537 

AAG CTG CAG CAG GCC CTG ACC CAG CTG CAG TCT GCA TGT 1800 

EKREQMERRLRTW 550 

GAG AAG CGA GAA CAG ATG GAG CGG AGA CTG CGG ACT TGG 1839 



LERELDALRTQQ* 563 
CTG GAG AGA GAG CTG GAT GCA CTG AGA ACC CAG CAG TAG 1878 

TCCTATGAGAAAGAAGAGAGAAGGTTCTTGAAGAATGTTCTTTGAAAGGAAA 1930 



ATGAGGCACAGAAACATGGAAATGGCCAGCCAGCCAACATGCCGGAATACAA 1982 

TGCCCCAGCCCTCCTGGAACTTGTGCGGGAGAAGGAGGAGCGGATCCTGGCC 2034 

CTGGAGGCCGACATGACAAAGTGGGAGCAGAAGTACCTGGAGGAGAGCACCA 2086 

TCCGACACTTTGCCATGAATGCCGCAGCCACTGCAGCAGCTGAGAGGGACAC 2138 

CACGATCATCAACCACTCACGGAATGGCAGCTACGGAGAGAGCTCGCTGGAG 2190 

GCCCGCATCTGGCAAGAGGAGGAGGAGGTGGTGCAGGCCAACAGAAGGTGTC 2242 

AGGACATGGAATACACTATTAAAAATCTCCATGCCAAAATCATAGAGAAAGA 2294 

TGCTATGATTAAGGTCCTGCAGCAGCGATCTCGTAAAGATGCCGGGAAGACA 2346 
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GACTCCTCCAGCCTACGTCCTGCCCGCTCCGTTCCATCCATAGCAGCAGCTA 2398 

CTGGGACACACTCTCGCCAGACCTCTCTTACCAGCAGCCAGCTGGCTGAGGA 2450 

AAAGAAGGAAGAGAAGACCTGGAAGGGGAGCATAGGATTGCTGCTGGGGAAG 2502 

GAGCACCATGAGCATGCCTCTGCCCCACTGCTGCCACCCCCACCCACCTCAG 2554 

CACTGTCCTCCATAGCCTCCACTACGGCAGCCAGCAGTGCCCACGCCAAGAC 2606 

AGGCAGCAAGGACAGCAGCACACAGACTGACAAGAGTGCCGAGCTCTTCTGG 2658 

CCCAGCATGGCCTCCCTTCCCAGCCGCGGCCGGCTGAGCACGACCCCTGCTC 2710 

ACAGCCCCGTCCTGAAACACCCAGCGGCCAAAGGGACCGCAGAGA7VACTGGA 2762 

GAACTCTCCTGGCCATGGGAAGTCGCCTGACCACAGAGGCCGGGTCAGCAGC 2814 

TTGCTGCACAAGCCCGAGTTCCCTGATGGAGAGATGATGGAAGTCCTCATCT 2866 

AACTGCCATCCCTGTGGAATTTCAGTACAGAACACTGACAAACAAGGAAAGC 2 918 

GGCAGAGAAAGAAGAAAGACCTAGAAGGTTGTAGATGGGAAATCAGGAATGA 2 970 

TTTGAACTGATAAAGATTTCAGACTCATAAGAACACATTTTATAAATGTTAA 3022 

ACACAAAAACTACATGACTGAAGATAGAAGAGAATGCGATGGATTTTATTAC 3 074 

ACATGGTGGAAGAGAGAAGAGGCGTGTAGGTTTGCAAACAAAGTTAAGAAAT 3126 

AGGAAACTGAATTTTTCATTGTACAGAAAATGTATCTCTTGGGGAGGGCCTG 3178 

TGTACCCCCATTCTCTGGTTATAAACAGATAAACCCAAAAAAAAAAAAAAAA 3230 

AAAAAAAAAAAAAAAAAA 3248 
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